Torleiv Klove 


CODES FOR 
ERROR DETECTION 





CODES FOR 
ERROR DETECTION 


Series on Coding Theory and Cryptology 


Editors: Harald Niederreiter (National University of Singapore, Singapore) and 
San Ling (Nanyang Technological University, Singapore) 





Published 


Vol. 1 Basics of Contemporary Cryptography for IT Practitioners 
B. Ryabko and A. Fionov 


Vol. 2 Codes for Error Detection 
by T. Kløve 


Series on Coding Theory and Cryptology —Vol. 2 





CODES FOR 
ERROR DETECTION 


Torleiv Kigve 


University of Bergen, Norway 


w World Scientific 


NEW JERSEY * LONDON eœ SINGAPORE + BEIJING e SHANGHAI e HONG KONG e TAIPEI e CHENNAI 


Published by 


World Scientific Publishing Co. Pte. Ltd. 

5 Toh Tuck Link, Singapore 596224 

USA office: 27 Warren Street, Suite 401-402, Hackensack, NJ 07601 
UK office: 57 Shelton Street, Covent Garden, London WC2H 9HE 


British Library Cataloguing-in-Publication Data 
A catalogue record for this book is available from the British Library. 


Series on Coding Theory and Cryptology — Vol. 2 
CODES FOR ERROR DETECTION 


Copyright © 2007 by World Scientific Publishing Co. Pte. Ltd. 


All rights reserved. This book, or parts thereof, may not be reproduced in any form or by any means, 
electronic or mechanical, including photocopying, recording or any information storage and retrieval 
system now known or to be invented, without written permission from the Publisher. 


For photocopying of material in this volume, please pay a copying fee through the Copyright 
Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to 
photocopy is not required from the publisher. 


ISBN-13 978-98 1-270-586-0 
ISBN-10 981-270-586-4 


Printed in Singapore. 


S.D.G. 


This page intentionally left blank 


Preface 


There are two basic methods of error control for communication, both in- 
volving coding of the messages. The differences lay in the way the codes 
are utilized. The codes used are block codes, which are the ones treated 
in this book, or convolutional codes. Often the block codes used are linear 
codes. With forward error correction, the codes are used to detect and cor- 
rect errors. In a repeat request system, the codes are used to detect errors, 
and, if there are errors, request a retransmission. 

Usually it is a much more complex task to correct errors than merely 
detect them. Detecting errors has the same complexity as encoding, which 
is usually linear in the length of the codewords. Optimal error correcting 
decoding is in general an NP-hard problem, and efficient decoding algo- 
rithms are known only for some classes of codes. This has generated much 
research into finding new classes of codes with efficient decoding as well as 
new decoding algorithms for known classes of codes. 

There are a number of books on error control, some are listed in the 
bibliography at the end of this book. The main theme of almost all these 
books is error correcting codes. Error detection tends to be looked upon 
as trivial and is covered in a few pages at most. What is then the reason 
behind the following book which is totally devoted to error detecting codes? 
The reason is, on the one hand, that error detection combined with repeat 
requests is a widely used method of error control, and on the other hand, 
that the analysis of the reliability of the information transmission system 
with error detection is usually quite complex. Moreover, the methods of 
analysis are often not sufficiently known, and simple rules of the thumb are 
used instead. 

The main parameter of a code for error detection is its probability of 
error detection, and this is the main theme of this book. There are many 
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papers devoted to the study of the probability of undetected error, both 
for symmetric channels and other channels, with or without memory. They 
are spread over many journals and conference proceedings. In Kløve and 
Korzhik (1995), which was published twelve years ago, we collected the re- 
sults then known, mainly for linear codes on the binary symmetric channel, 
and presented them in a unified form. We also included a number of new 
results. In the last twelve years a number of significant new results have 
been published (in approximately one hundred new papers). In the present 
book, we have included all the important new results, and we also include 
some new unpublished results. As far as possible, the results are given 
for both linear and non-linear codes, and for general alphabet size. Many 
results previously published for binary codes or for linear codes (or both) 
are generalized. 

We have mainly restricted ourselves to channels without memory and to 
codes for error detection only (not combined error correction and detection 
since these results belong mainly with error correcting codes; the topic is 
briefly mentioned, however). 

Chapter 1 is a short introduction to coding theory, concentrating on 
topics that are relevant for error detection. In particular, we give a more 
detailed presentation of the distance distribution of codes than what is 
common in books on error correction. 

Chapter 2 is the largest chapter, and it contains a detailed account of the 
known results on the probability of undetected error on the q-ary symmetric 
channel. Combined detection and correction will be briefly mentioned from 
the error detection point of view. 

Chapter 3 presents results that are particular for the binary symmetric 
channel. 

Chapter 4 considers codes for some other channels. 

Each chapter includes a list of comments and references. 

Finally, we give a bibliography of papers on error detection and related 
topics. 


The required background for the reader of this book will be some ba- 
sic knowledge of coding theory and some basic mathematical knowledge: 
algebra (matrices, groups, finite fields, vector spaces, polynomials) and el- 
ementary probability theory. 


Bergen, January 2007 
T. Kløve 
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Chapter 1 


Basics on error control 


1.1 ABC on codes 


1.1.1 Basic notations and terminology 


The basic idea of coding is to introduce redundancy that can be utilized to 
detect and, for some applications, correct errors that have occurred during 
a transmission over a channel. Here ” transmission” is used in a wide sense, 
including any process which may corrupt the data, e.g. transmission, stor- 
age, etc. The symbols transmitted are from some finite alphabet F. If the 
alphabet has size q we will sometimes denote it by F}. We mainly consider 
channels without memory, that is, a symbol a € F is transformed to b € F 
with some probability z(a,b), independent of other symbols transmitted 
(earlier or later). Since the channel is described by the transition prob- 
abilities and a change of alphabet is just a renaming of the symbols, the 
actual alphabet is not important. However, many code constructions utilize 
a structure of the alphabet. We will usually assume that the alphabet of 
size q is the set Z, of integers modulo q. When q is a prime power, we will 
sometimes use the finite field GF (q) as alphabet. The main reason is that 
vector spaces over finite fields are important codes; they are called linear 
codes. 

As usual, F” denotes the set of n-tuples (a1, a2,:-- ,@n) where a; E€ F. 
The n-tuples will also be called vectors. 

Suppose that we have a set M of M possible messages that may be sent. 
An (n, M; q) code is a subset of F” containing M vectors. An encoding is a 
one-to-one function from M to the code. The vectors of the code are called 
code words. 
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1.1.2 Hamming weight and distance 


The Hamming weight w(x) of a vector x is the number of non-zero posi- 
tions in x, that is 
wy(x) = #{i|1<i< n and z; #0}. 

The Hamming distance dy(x,y) between two vectors x,y € F is the 
number of positions where they differ, that is 

dy(x,y) = #{i|1<i<nand z; Z yi} 
If a vector x was transmitted and e errors occurred during transmission, 
then the received vector y differs from x in e positions, that is dy(x,y) = e. 

Clearly, 
du(x,y) = wu(x — y). 
For an (n, M;q) code C, define the minimum distance by 
d= d(C) = min{dy(x,y)|x,y eC, xy}, 
and let 
d(n, M; q) = max{d(C) | C is an (n, M; q) code}. 

Sometimes we include d = d(C) in the notation for a code and write 
(n, M, d; q) and |[n,k,d;q]. The rate of a code C C Fẹ is 


R= Berte 
n 
Define z 
in mo) = Seles) 
and 


(R; q) = lim sup ô(n, R; q). 


n— Ooo 


1.1.3 Support of a set of vectors 


For x = (£1, £2,..., £n) Y = (Y1, Y2,---;Yn) E F”, we define the support of 
the pair (x,y) by 
x(x, y) = {i € {E2 -a n} | Ti = yi}. 

Note that 

#X(X, y) = dy (x, y). 
For a single vector x € F”, the support is defined by x(x) = x(x, 0). 

For a set S C F”, we define its support by 
x(S)= U xy). 
xyes 

In particular, x(S) is the set of positions where not all vectors in S are 


equal. 


Basics on error control 3 


1.1.4 Extending vectors 


Let x = (£1, £2, ,Un) E F”, y = (y1,Y2,°°+ Ym) E F™ and u E F. 
Then 


UX = (ua, UT2, `` ,ULn), 
(x|u) = (£1, £2, a Pay U), 


(xly) E (£1, £2, st s n, Y1, Y2; Ym). 
The last two operations are called concatenation. For a subset S of F”, 


uS = {ux |x € S}, 


(Slu) = {(xlu) |x € 5}. 


1.1.5 Ordering 


Let some ordering < of F be given. We extend this to a partial ordering of 
F” as follows: 


(£1, £2, , an) < (Y1, Y2; Yn) if ri < yi fr1<i<n. 


For Z4 we use the natural ordering 0 < 1< 2... <q-1. 


1.1.6 Entropy 


The base q (or q-ary) entropy function H,(z) is defined by 


H,(z) = -zlo8,(—>) — (1 — z) log, (1 — 2) 


for 0 < z < 1. H,(z) is an increasing function on lo, I , H,(0) = 0, and 
Ha (=) = 1. Define p(z) = pq(z) on [0,1] by p(z) € [o, 1 and 


Ap (po(z)) =1- 2z. 


1.1.7 Systematic codes 
An (n,q";q) code C is called systematic if it has the form 
C= {(xlf(x)) |x € FF} 


where f is a mapping from F} to ee, Here (x|f(x)) denotes the con- 
catenation of x and f(x). 
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1.1.8 Equivalent codes 


Two (n, M;q) codes C1, C2 are equivalent if Cp can be obtained from Ci 
by permuting the positions of all code words by the same permutation. 
We note that equivalent codes have the same distance distribution, and in 
particular the same minimum distance. 


1.1.9 New codes from old 


There are a number of ways to construct new codes from one or more old 
ones. We will describe some of these briefly. In a later section we will 
discuss how the error detecting capability of the new codes are related to 
the error detecting capability of the old ones. 


Extending a code 


Consider an (n, M; q) code C. Let b = (bi,b2,--- ,bn) € FẸ. Let C™ be 
the (n + 1, M; q) code 


C {lar az, „an, — X aibi) | (a1,a2,'** ,an) € o}. 
i=1 


Note that this construction depends on the algebraic structure of the al- 
phabet F, (addition and multiplication are used to define the last term). 
For example, let n = 2, b = (1,1), and a = (1,1). If the alphabet is GF(4), 
then a,b; + a2b2 = 0, but if the alphabet is Z4, then a,b; + a2b2 = 2 Æ 0. 


Puncturing a code 


Consider an (n, M; q) code. Puncturing is to remove the first position from 
each code word (puncturing can also be done in any other position). This 
produces a code CP of length n — 1. If two code words in C are identical, 
except in the first position, then the punctured code words are the same. 
Hence the size of CP may be less than M. On the other hand, any code 
word c € CP is obtained from a vector (alc) where a € Fy. Hence, the 
size of CP is at least M/q. The minimum distance may decrease by one. 
Clearly, the operation of puncturing may be repeated. 


Basics on error control 5 


Shortening a code 
Consider an (n, M;q) code C with the first position in its support. Short- 
ening (by the first position) we obtain the (n — 1, M’; q) code 
C= {x Epa | (O|x) € c}, 

that is, we take the set of all code words of C with 0 in the first position 
and remove that position. More general, we can shorten by any position in 
the support of the code. 

We note that shortening will not decrease the minimum distance; how- 


ever it may increase it. In the extreme case, when there are no code words 
in C with 0 in the first position, C5 is empty. 


Zero-sum subcodes of a code 


Consider an (n, M; q) code C. The zero-sum subcode C” is the code 


O= {laraz an) EC | S =o}. 
i=l 


Also this construction depends on the algebraic structure of the alphabet. 
In the binary case, )7;_, a; = 0 if and only if wy(a) is even, and C” is 
then called the even-weight subcode. 


1.1.10 Cyclic codes 
A code C C F” is called cyclic if 


(@n—1,4n—2,°** ,@0) € C implies that (an_2,a@n_—3,--+ , ao, @n—1) € C. 


Our reason for the special way of indexing the elements is that we want to 
associate a polynomial in the variable z with each n-tuple as follows: 


a = (Gn—1,@n—2,..+,09) + a(z) = G12” * + an—22"7? +++ + a. 


This correspondence has the following property (it is an isomorphism): if 
a,b € F” and c E€ F, then 


a+b a(z) + 0(z), 
ca © ca(z). 
In particular, any code may be represented as a set of polynomials. More- 
over, the polynomial corresponding to (@n—2,@n—3,°+* , Q0, @n—1) is 


n—-2 
n-i + 5 ajz*t! = za(z) — an1 (2” — 1) = za(z) (mod z” — 1). 
i=0 
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1.2 Linear codes 


An Jn, k; q] linear code is a k-dimensional subspace of GF'(q)”". This is in 
particular an (n,q*;q) code. Vector spaces can be represented in various 
ways and different representations are used in different situations. 


1.2.1 Generator and check matrices for linear codes 


Suppose that {gi,g2,---,gx} is a basis for C. Then C is the set of all 
possible linear combinations of these vectors. Let G be the k x n matrix 
whose k rows are g1, £2,°:: , gẹ. Then 


C = {xG | x € GF(q)*}. 
We call G a generator matriz for C. A natural encoding GF (q)* > GF(q)” 
is given by 
xh xG. 


If T : GF(q)* — GF(q)* is a linear invertible transformation, then TG 
is also a generator matrix. The effect is just a change of basis. 
The inner product of two vectors x,y € GF(q)” is defined by 


xy = xy = Sn 
i=1 
where y* is the transposed of y. For a linear fn, k; q], the dual code is the 
[n,n — k; q] code 
Ct = {x € GF(q)” | xc‘ = 0 for all c € C}. 
If H is a generator matrix for C+, then 
C = {x € GF(q)" |xH* = 0}, 


where H* is the transposed of H. H is known as a (parity) check matrix 
for C. Note that GH* = 0 and that any (n — k) x n matrix H of rank n — k 
such that GH* = 0 is a check matrix. 


1.2.2 The simplex codes and the Hamming codes 


Before we go on, we define two classes of codes, partly because they are 
important in their own right, partly because they are used in other con- 
structions. 
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Let Ty, be ak x gal matrix over GF (q) such that 
(i) all columns of Tẹ are non-zero, 
(ii) ifx #y are columns, then x ¥ jy for all j € GF(q). 
The matrix Ty, generates a [E asa code Sk whose non-zero 
code words all fe as q*—!. It is known as the Simplex code. The 


dual code is an Ea 7a k, 3; a| code known as the Hamming code. 


1.2.3 Equivalent and systematic linear codes 


Let Cı be an fn, k; q] code and let 


where Q is a non-singular diagonal n x n matrix and II is an n x n permu- 
tation matrix. If G is a generator matrix for C1, then GQII is a generator 
matrix for C2. 

Let G be a k x n generator matrix for some linear code C. By suitable 
row operations this can be brought into reduced echelon form. This matrix 
will generate the same code. A suitable permutation of the columns will 
give a matrix of the form (J;,|P) which generates a systematic code. Here 
Iņ is the identity matrix and P is some k x (n — k) matrix. Therefore, any 
linear code is equivalent to a systematic linear code. Since 


(Tk|P)(—P*|In—z)* = —P+ P=0, 


H = (—P'|I,-~) is a check matrix for C. 


1.2.4 New linear codes from old 
Extending a linear code 


If C is a linear code, then C'™ is also linear. Moreover, if H is a check 
matrix for C, then a check matrix for C®* (where C is extended by b) is 


H 0t 
b 1j’ 
In particular, in the binary case, if bı bg ee bn 1, we have 


extended the code with a parity check. The code (GF(2)”)™* is known as 
the single parity check code or just the parity check code. 
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Shortening a linear code 


Shortening a linear code gives a new linear code. If G = (J;,|P) generates a 
systematic linear code and the code is shortened by the first position, then 
a generator matrix for the shortened code is obtained by removing the first 
row and the first column of G. 


Puncturing a linear code 


Consider puncturing an [n, k, d; q] code C. If the position punctured is not 
in the support of C, then CP is an [n — 1,k,d;q] code. If the position 
punctured is in the support of C, then CP is an [n — 1,k — 1,d’;q] code. 
If d > 1, then d’ = d or d’ = d— 1. If d = 1, then d’ can be arbitrary 
large. For example, if C is the [n,2,1;2] code generated by (1,0,0,...,0) 
and (1,1,1,...,1), and we puncture the first position, the resulting code is 
a [n —1,1,n — 1; 2] code. 


The *-operation for linear codes 


Let C be an [n,k; q] code over GF(q). Let C* denote the [n+ Čik; q 
code obtained from C by extending each code word in C by a distinct 
code word from the simplex code S;. We remark that the construction is 
not unique since there are many ways to choose the code words from Sx. 
However, for error detection they are equally good (we will return to this 
later). 

We also consider iterations of the *-operation. We define C7™* by 


Co = C, 
orth = (om) 


Product codes 


Let Cı be an [n1, ki, di; q] code and C2 an [n2, ke, d2;q] code. The product 
code is the [ny ne, kik, did; q] code C whose code words are usually written 
as an nı X Ng array; C is the set of all such arrays where all rows belong to 
Cı and all columns to Cp. 


Tensor product codes 


Let Cı be an [n1, ki; q] code with parity check matrix 


(1) 
Ay = (hi; )i<i<mi—in1<j<m 
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and Cy an [n2, k2;q] code with parity check matrix 
H = (hE). <; , 
2 ij J1<iK<n2—k2,1Lj<n2: 


The tensor product code is the [n1n2, n1k2+nek1 — kık2; q] code with parity 
matrix H = (hij) which is the tensor product of Hı and Ho, that is 


— p] pi2] 
ħi, (n2—k2)+i2,jin2+j2 = en hiz jz 


Repeated codes 


Let C be an (n, M; q) code and let r be a positive integer. The r times 
repeated code, C” is the code 


C” = {(e1|cg|---|e,) | ¢1,€2,...,¢, € C}, 


that is, the Cartesian product of r copies of C. This is an (rn, M”; q) code 
with the same minimum distance as C. 


Concatenated codes 


Codes can be concatenated in various ways. One such construction that 
has been proposed for a combined error correction and detection is the 
following. 

Let Cı be an [N, K;q] code and Co an fn, k; q] code, where N = mk for 
some integer m. The encoding is done as follows: K information symbols 
are encoded into N symbols using code C1. These N = mk are split into 
m blocks with k symbols in each block. Then each block is encoded into n 
symbols using code C2. The concatenated code is an [mn, K; q] code. If Gi 
and G2 are generator matrices for C1 and C2 respectively, then a generator 
matrix for the combined code is the following. 


Ga 0- 0 
0 G2- 0 
Gif 
OO" Gk Ge 


The construction above can be generalized in various ways. One gener- 
alization that is used in several practical systems combines a convolutional 
code for error correction and a block code (e.g. an CRC code) for error 
detection. 
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1.2.5 Cyclic linear and shortened cyclic linear codes 


Many important codes are cyclic linear codes or shortened cyclic linear 
codes. One reason that cyclic codes are used is that they have more alge- 
braic structure than linear codes in general, and this structure can be used 
both in the analysis of the codes and in the design of efficient encoders 
and decoders for error correction. For example, the roots of the polyno- 
mial g(z), given by the theorem below, give information on the minimum 
distance of the code. Hamming codes is one class of cyclic codes and short- 
ened Hamming codes and their cosets are used in several standards for data 
transmission where error detection is important. This is our main reason 
for introducing them in this text. 


Theorem 1.1. Let C be a cyclic [n,k;q| code. Then there exists a monic 
polynomial g(z) of degree n — k such that 


C = {v(z)g9(z) | deg(u(z)) < k}. 


Proof. Let g(z) be the monic polynomial in C of smallest positive degree, 
say degree m. Then z*g(z) € C for 0 < i < n-m. Let a(z) be any non-zero 
polynomial in C, of degree s, say; m < s < n. Then there exist elements 
Cs—m;Cs—m—1;'** , Co E GF (q) such that 


ra= y ar 
i=0 


has degree less than m (this can easily be shown by induction on s). Since 
C is a linear code, r(z) € C. Moreover, there exists a c € GF (q) such that 
cr(z) is monic, and the minimality of the degree of g(z) implies that r(z) 
is identically zero. Hence a(z) = v(z)g(z) where v(z) = 0329" iz’. In 
particular, the set 


{9(z), zg(2) = 277" g(z)} 
of n — m polynomials is a basis for C and son — m = k, that is 





k=n-m. 











The polynomial g(z) is called the generator polynomial of C. 


If g(1) Æ 0, then the code generated by (z—1)g(z) is an [n+ 1, k; q] code. 
It is the code C®™ obtained from C extending using the vector 1 = (11 --- 1), 
that is 


n 
{laraz „an, =X ai) | (a1, d2,°°° Qn) = c}. 
i=1 
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Encoding using a cyclic code is usually done in one of two ways. Let 
V = (Uk—1, Uk—2,°°* , V0) E GF(q)* be the information to be encoded. The 
first, and direct way of encoding, is to encode into v(z)g(z). On the other 
hand, the code is systematic, but this encoding is not. The other way of 
encoding is to encode v into the polynomial in C ”closest” to z”~"v(z). 
More precisely, there is a unique a(z) of degree less than k such that 


—r(z) = 2" *u(z) — a(z)g(2) 
has degree less than n — k, and we encode into 
a(z)g(z) = z”-*v(z) + r(z). 
The corresponding code word has the form (v|r), where r € GF(q)"~*. 


Theorem 1.2. Let C be a cyclic [n,k;q| code with generator polynomial 
g(z). Then g(z) divides z” — 1, that is, there exists a monic polynomial 
h(z) of degree k such that 


g(z)h(z) =z” -1. 


Moreover, the polynomial 


is the generator polynomial of C+. 


Proof. There exist unique polynomials h(z) and r(z) such that 
z” —1 = g(z)h(z)+ r(z) 


and deg(r(z)) < n— k. In particular r(z) = h(z)g(z) (mod z” — 1) and so 
r(z) € C. The minimality of the degree of g(z) implies that r(z) = 0. 
Let g(z) = ay giz’ and h(z) = yy hizt. Then 


-lif 1=0, 
X gmihi=4 Oif 0<1<n, 
i lif l=n. 


Further, h(z) = —go Weg he—iz’. Since —goho = 1, A(z) is monic. Let 
y= (0,0,--- 0, On—ks + , 9o), u= (0,0,:-- 0, ho, +> she), 
and let v, ul be the vectors l times cyclicly shifted, that is 


uw! = (hk-i+1, he—142, °° hx, 0,- ey, , 0, ho, hi,’ aa het), 
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and v! similarly. First, we see that 


k 


v-u = X gett =0 
i=0 


for —k < L < n — k. Hence, 
v”.ul =v u” =0 


fr 0 < m < k and 0 < l < n — k; that is, each basis vector for C is 
orthogonal to each basis vector in the code C generated by h(z), and so 
C=C 














The polynomial g(z) of degree m is called primitive if the least posi- 
tive n such that g(z) divides z” — 1 is n = (q™ — 1)/(q — 1). The cyclic 


E i q”—1 


code C generated by a primitive g(z) of degree m is aigi a 


Hamming code. 


The code obtained by shortening the cyclic [n, k; q] code C m times is 
the [n — m, k — m; q| code 


{v(z)g(z) | deg(v(z)) < k'} (1.1) 


where k’ = k— m. Note that (1.1) defines an [n — k + k’,k’;q] code for 
all k’ > 0, not only for k’ < k. These codes are also known as cyclic 
redundancy-check (CRC) codes. The dual of an [n — k + k', k’; q] code C 
generated by g(z) = yey giz’ where gn-k = 1 can be described as a 
systematic code as follows: The information sequence (an-k—1,...,a&o) is 
encoded into the sequence (@n—~—-1+k/,---,@o) where 

n—k-1 


aj = J Gidj—-nt+k+i- 
i=0 


This follows from the fact that 
os 
(an k—1+k'; n-k 24+k',--- a0) (0,0,...,0, gn-k;---,90,0,...,0) = 0 





pa ae 

for 0 < i < k’ by definition and that (0,0,...,0,gn—z,---;90,0,..-,0) 
where 0 <i < k’ is a basis for C. 

A number of binary CRC codes have been selected as international 

standards for error detection in various contexts. We will return to a more 


detailed discussion of these and other binary CRC codes in Section 3.5. 
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1.3 Distance distribution of codes 


1.3.1 Definition of distance distribution 
Let C be an (n, M; q) code. Let 


Ai = Ai(C) = =H Gey) [x,y € C and du(x,y) =i}. 


The sequence Ag, A1,- , An is known as the distance distribution of C and 
Ac(z) = > Aiz’ 
i=0 


is the distance distribution function of C. 

We will give a couple of alternative expressions for the distance dis- 
tribution function that will be useful in the study of the probability of 
undetected error for error detecting codes. 


1.3.2 The MacWilliams transform 


Let C be an (n, M; q) code. The Mac Williams transform of Ac(z) is defined 
by 


1 =z 
AG(z) = —(1 + (q — 1)z)"Ac | ———— }. 1.2 
tO = F0 +a- aao (=) (1.2) 
Clearly, AG(z) is a polynomial in z and we denote the coefficients of Ad(z) 
by A} = A}(C), that is, 


Ag(z) = 5 At 2’. 
i=0 


In particular, At = 1. 

The reason we use the notation A&(z) is that if C is a linear code, 
then A(z) = Aci (z) as we will show below (Theorem 1.14). However, 
AG(z) is sometimes useful even if C is not linear. The least ¢ > 0 such that 
A;(C) £0 is known as the dual distance d+ (C). 

Substituting 7 TG for z in the definition of AG(z) we get the follow- 
ing inverse relation. 








Lemma 1.1. 
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Differentiating the polynomial (1.3) s times and putting z = 1 we get 
the following relations which are known as the Pless identities. 


Theorem 1.3. Let C be an (n,M;q) code and s > 0. Then 


DA (3) = MS apoa- ge (ar 


ie 
j=0 a 


In particular, if s < d+, then 


2a. 


From (1.2) we similarly get the following relation. 


Theorem 1.4. Let C be an (n, M;q) code and s > 0. Then 


Sah) = 87 eames (07)) 


In particular, if s < d, then 


pQ- e) 


Two important relations are the following. 





Theorem 1.5. Let C be an (n, M;q) code over Z, and let ¢ = e?" VT, 
Then 


AOs L et 


ucz? ecc 
wyp (u)=i 





for0<i<n. 


Note that ¢7 = 1, but (? Æ 1 for 0 < j < q. Before we prove Theorem 
1.5, we first give two lemmas. 


Lemma 1.2. Letv € Z,. Then 


Uv ,.WH(U) __ 1+(q-l1)a if v =0, 
ds ={i78 ifv £0. 


uEZa 
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Proof. We have 


pyre wuH( rarteon 


uE Za 


If v = 0, the sum is clearly 1 + ae —1). If v #0, then 


q—1 


u=1 

















Lemma 1.3. Let v € Z,. Then 


ye Coe te OE Ga, 
uczy 


Proof. From the previous lemma we get 


> CEY gUn (u) 


uezo 
= X gui eH (u1) X Cu202 pwn (u2) ra X Cun in pW (u) 
u1CZq u2€Zq UnEZq 





(= a)" (1+ (q= Dayne. 











We can now prove Theorem 1.5. 


Proof. Since dy(c,c’) = wH(e —c’), Lemma 1.3 gives 


n 


Ate = x > Ail = a) + (a - 1a)" 


i=0 i=0 


=p Dba + q- Yay sno 


cEC c EC 


z = SI E ree gua 


ceC c'EC uczy 
1 , 
Sar D a as 
ucZy cec c/EC 
Observing that 


Sea er Se, 


cEC c/EC cec 














the theorem follows. 
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When the alphabet is GF (q), there is a similar expression for A}(C). 
Let q = p”, where p is a prime. The trace function from GF(q) to GF(p) 
is defined by 


T= ; 
= 5 a”. 
i=0 
One can show that Tr(a) € GF (p) for all a € GF (q), and that Tr(a + b) = 
Tr(a) + Tr(b). 


Theorem 1.6. Let q =p” where p is a prime. Let C be an (n, M;q) code 
over GF(q) and let ¢ = e2"V—1/”, Then 


AŁ(C y Feee a 


u€GF(q)” cec 
wy (ust 





for0<i<n. 
The proof is similar to the proof of Theorem 1.5. 


Corollary 1.1. Let C be an (n, M; q) code (over Z, or over GF(q)). Then 
At (C) >0 for0<i<n. 


1.3.3 Binomial moment 


We have 


n 


Ajx) = X Aji (a +1— x) 4 
j=l 


j=l 


= Day @ Vat — 2)r-i! 
-50-0 a(t7)). aa) 


1—3 





The binomial moment is defined by 


-Zsofi i) 
n—i 
forl<i<n. 
The relation (1.4) then can be expressed as follows: 
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Theorem 1.7. Let C be an (n,M;q) code. Then 


Ac(x) =1+ D A (OJ (1 — x). 


We note that the A; can be expressed in terms of the Aj. 


Theorem 1.8. 


forl<ic<n. 


Proof. 


17 


= = = 
E E 
Sopor 
e A 
- $ o(a 
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We can also express A? in terms of the A7. We have 


Ac(x) -—1= = Ar ~a)i(1+(q—1)x)"4 -1 





-ŻY A-a D Cog eo be 





-2 z A; (1-2)? 5 al Z ’) qit'(1—2)"-9-* 


N 
d j= i=0 


Hence we get the following result. 


Theorem 1.9. Let C be an (n, M;q) code. Then, fori <i<n, 


womi ("7 a- (1) 


= P (Mé —1) + Mq” 3 Je 


i 
jad 
From the definition and Theorem 1.9 we get the following corollary. 


Corollary 1.2. Let C be an (n, M;q) code with minimum distance d and 
dual distance d+. Then 


A3(C) =0 fori <i<d-1, 


AS (C) > max{ 0, (") (Mq" — 1)} ford<i<n-dt, 


and 
n 


&0o=(; 


Jai -1) forn-dt<i<n. 
i 


There is an alternative expression for A?(C’) which is more complicated, 
but quite useful. 
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For each set Æ C {1,2,...,n}, define an equivalence relation ~g on C 
by x ~g y if and only if x(x, y) C E (that is, x; = y; for alli ¢ E). Let 
the set of equivalence classes be denoted Xg. If two vectors differ in at 
least one position outside E, then they are not equivalent. Therefore, the 
number of equivalence classes, that is, the size of Xp, is q’~7#*. 


Theorem 1.10. Let C be an (n,M;q) code. Then, for1<i<n, 


AOS E E wW- 


BC{1,2,...n} UEXp 
#E=j 


Proof. We count the number of elements in the set 
V = {(E,x,y) | EC {1,2,... n}, #E = j,x,y E C,X ŻY, X ~E Y} 

in two ways. On one hand, for given E and an equivalence class U € Xg, 
the pair (x,y) can be chosen in #U(#U — 1) different ways. Hence, the 
the number of elements of V is given by 

#4V= J. J) #U(#U-1). (1.5) 

Bc{12,-.n} UEXE 

On the other hand, for a given pair (x,y) of code words at distance i < j, 
E must contain the i elements in the support x(x, y) and j — i of the n— i 
elements outside the support. Hence, E can be chosen in (Gee: 
a pair (x,y) of code words at distance i can be chosen in MA;(C) ways, 
we get 


) ways. Since 


#V =J MAC) Ce j = MAS(C). (1.6) 


Theorem 1.10 follows by combining (1.5) and (1.6). 














From Theorem 1.10 we can derive a lower bound on A§(C) which is 
sharper than (or sometimes equal to) the bound in Corollary 1.2. 
First we need a simple lemma. 


Lemma 1.4. Let mı, m2,..., my be non-negative integers with sum M. 
Then 
M M M \2 
2 Qlyl +) Dealy]  @ 
am PANA NIAT Ly (1:7) 
M M 
a =) a i 
+ N N (1.8) 
with equality if and only if 
iv] ss [R] 
Nee ae 


for alli. 
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Bate oe s N : 
Proof. Let x1,%2,...,x%y be non-negative integers for which )7;_, x? is 


minimal. Without loss of generality, we may assume that 7, < £i < £N 
for all i. Suppose zy > zı +2. Let yı = z1 +1, yn = TN -— l, Yi = Ti 
otherwise. Then, by the minimality of > x?, 


o< y- Sat (x1 +1)? — x? + (zyn — 1)? — 23, = 2(x1 — ey + 1), 


contradicting the assumption xy > x, +2. Therefore, we must have 
£N = T1 + 1l or gy = tı. 


Let a = |M/N| and M = Na + where 0 < 8 < N. Then 8 of the 2; 
must have value a + 1 and the remaining N — p have value a and so 


N 
Xa? = Ba +1)? + (N - B)o? = (2a + 1)8 + Na. 


This proves (1.7). We have 


[=] = ait 8 =0, and [a] =atiits>o. 











Hence (1.8) follows by rewriting (1.7). 





Using Lemma 1.4, with the lower bound in the version (1.8), we see that 
the inner sum ` yex, #U(#U — 1) in Theorem 1.7 is lower bounded by 


P 1) > (5 - 1) (2M—-a[—)), 


independent of Æ. For E there are (5) possible choices. Hence, we get the 
following bound. 





Theorem 1.11. Let C be an (n, M;q) code. Then, fori <j <n, 
432 (5) (gel -)@- Seles). 


1.3.4 Distance distribution of complementary codes 











There is a close connection between the distance distributions of a code 
and its (set) complement. More general, there is a connection between 
the distance distributions of two disjoint codes whose union is a distance 
invariant code. 
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An (n, M; q) code is called distance invariant if 
Der Aa) 
yer 
for all x € C. In particular, any linear code is distance invariant. However, 


a code may be distance invariant without being linear. 


Example 1.1. A simple example of a non-linear distance invariant code is 
the code 


{(1000), (0100), (0010), (0001)}. 


Theorem 1.12. Let the (n, Mı;q) code Cı and the (n, Mo;q) code C2 be 
disjoint codes such that C1 U Ca is distance invariant. Then, 


Mi { Acucs (z) - Ac, (2)} = Maf Acyucs (z) — Ac, (2). 


Proof. Since C1 U Ch is distance invariant, we have 


MiAcue,(2)= >> SO 2 


xEC 1 YEC1UC2 


= 5 5y zmay) 4 5 y zdu 


xE€C1 yEC1 xE€Ci = 
d 
= MiAo,(2) + DPD) emer 
xE€C1 yEC2 
and so 
d 
Mi { Acuca(? ) — Ao, (z b= y DE uy), 
xEC1 yEC2 
Similarly, 


Maf Acyuca(2) — Ac, (2 jts So 2a, 


xEC 1 yEC2 











and the theorem follows. 





If Cp = Ci, then the conditions of Theorem 1.12 are satisfied. Since 
Cı UC2 = Fj we have Mz = q” — Mı and Ac,uc, (2) = (1 + (q — 1)z)”. 
Hence we get the following corollary. 


Corollary 1.3. Let C be an (n,M;q) code. Then 


1+(q-1)z)”. 
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From Corollary 1.3 we immediately get the following corollary. 


Corollary 1.4. Let C be an (n, M;q) code. Then, for0<i<n, we have 


A@) = AC) + SF a- 


Using Corollary 1.4 we get the following. 


Corollary 1.5. Let C be an (n, M;q) code. Then, for1<i<n, we have 


= M q’—2M (n\, ,; 
A} (C) = ao Ei) Lye 


Proof. We have 

















j=l 

Mcgi z) q -2M < en 
= —— y5 A (C Ahaa Vt. )(q@- 2) 
T-M al Ce pom à n—ijJ\j ( ) 
— M om -2M [A (i\, a 
= e+ HY (Ge 

M S q” —2M (n ; 
= °(C) 4 E ajy, 

i + SSE (ae) 











1.4 Weight distribution of linear codes 


1.4.1 Weight distribution 


Let 
AY = AF (C) = #{x € C | w(x) = i}. 
The sequence Aj’, AY,--- , AW is known as the weight distribution of C and 
A&(2) = 5° AY z! 
i=0 


is the weight distribution function of C. 
We note that dy(x,y) = wu(x—y). If C is linear, then x— y € C when 
x,y € C. Hence we get the following useful result. 


Theorem 1.13. For a linear code C we have A;(C) = A¥(C) for alli and 
Ac(z) = AG(z). 
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If C and C” are equivalent codes, then clearly A;(C) = A;(C’). In 
particular, for the study of the weight distribution of linear codes we may 
therefore without loss of generality assume that the code is systematic if 
we so wish. 


1.4.2 Weight distribution of *-extended codes 


The -operation for linear codes was defined on page 8. The code Sj is a 


constant weight code, that is, all non-zero code words have the same weight, 
namely g*-?. 
Therefore, Ac«(z) only depends on Ac(z). In fact 
Ac+(z) -1= 2% (Ac(z) -1) 
since each non-zero vector is extended by a part of weight g*~!. 


1.4.3 Mac Williams’s theorem 
The following theorem is known as Mac Williams’s theorem. 


Theorem 1.14. Let C be a linear |n, k;q] code. Then 
AL(C) = Ai(C*). 
Equivalently, 
1 l-z 
A = — (1 —1)z)"Ac | ————_ }. 
cl) = 0 +a- 1)2)"0 (E) 

Proof. We prove this for q a prime, using Theorem 1.5. The proof for 
general prime power q is similar, using Theorem 1.6. First we show that 

M if u € C~ 
bD a i R (1.9) 
ere 0 ifugC-. 
If u € C+, then u- c = 0 and ¢¥*¢ = 1 for all c € C, and the result follows. 


If u g C+, then there exists a code word c’ € C such that u-c’ Æ 0 and 
hence ¢%¢’ Æ 1. Because of the linearity, c + c’ runs through C when c 


does. Hence 
5 Gee = >D cutete) = aus 5 gee, 


cEC cEC cEC 
Hence Jeeg C19 = 0. This proves (1.9). By Theorem 1.5, 


sO E 
ce 


2 1 
~ M 
ucz? 


wp (u)=t w py (u)=i 
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Corollary 1.6. Let C be a linear |n, k;q] code. Then 
d(C) = d(C+). 


1.4.4 A generalized weight distribution 


Many generalizations of the weight distribution have been studied. One 
that is particularly important for error detection is the following. 

Let C be an [n, k] code and m a divisor of n. Let A; iz,- ‚im (C) be the 
number of vectors (x1|X2| +++ |Xm) € C such that each part x; € GF(q)"/™ 
and wy(x,;) = i; for j = 1,2,--- ,m. Further, let 


Aolenza 12m) = D, Ainin sim (O)e E 2a 
ii, i2, stm 
For m = 1 we get the usual weight distribution function. Theorem 1.14 
generalizes as follows. 


Theorem 1.15. Let C be a linear [n,k;q] code. Then 


Ac (21, 22,°°* 52m "H (1+ (q — 1)z;) Aca (zis Z3 Zin) 


where 
ges Ee 
I~ Tey 
1.4.5 Linear codes over larger fields 


There is an alternative expression for the weight distribution function that 
is useful for some applications. Let G be a k x n generator matrix over 
GF(q). Let mg : GF(q)* — N = {0,1,2,...}, the column count function, 
be defined such that G contains exactly m(x) = me@(x) columns equal to 
x for all x € GF(q)*. We use the following further notations: 


lale = Mizo (9° =e): 
s(U, m) = yey M(x) for all U C GF(q)*, 
Sx is the set of | dimensional subspaces of GF(q)*, 


on(m, 2) = Dyes, 20, where Ü = GF(q)* \U, 
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Û = {y € GF(q")* |y -x =0 for x € GF(q)* if and only if x € U}, 
C, = {yG | y € GF(q’)*}, the code generated by G over GF(q’), 


C=C. 


Theorem 1.16. For r > 1 we have 
k 


Ac,.(z) = X [rle—-voei(m, z). 


1=0 
Proof. First we note that if y € U, then 


wu(yG) = 5 m(x)wg(y +x) = 5 m(x) = s(U, m). 


xEGF(q)F xEU 
Hence 
k k 
2 eres ye yi 
l=0 UESx1 yeu l=0 UESk1 yeU 
Since 


Ss 1=f r]k-i, 


ycÛ 











the theorem follows. 





For r = 1, we get the following alternative expression for the weight 
distribution of C. 


Corollary 1.7. We have 


Ac(z)=1+ © 


UESk,k—1 
1.4.6 Weight distribution of cosets 


Theorem 1.17. Let C be an [n,k;q| code and S a proper coset of C. Let 
D be the |n, k + 1;q] code containing C and S. Then 


A¥(2) = ——{ An(z) - Acta). 
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Proof. For each non-zero a € GF (q), aS = {ax | x € S} is also a proper 
coset of C and Ai’,(z) = AŞ (2). Further D = CUU, 49 aS (disjoint union) 
and so 

Ap(z) = Ac(z) + (q — 1)AS(2), (1.10) 


and the theorem follows. 














Using the MacWilliams identity we get the following alternative expres- 
sion. 


Corollary 1.8. Let C be an [n,k;q] code and S a proper coset of C. Let 
D be the [n,k + 1;q] code containing C and S. Then 


Theorem 1.18. Let C be an [n,k;q] code and S a proper coset of C. Then 
Aaa) > 2" Age) (1.11) 
for all z € [0,1]. 
Proof. We may assume without loss of generality that the code C is 
systematic. There exists a v € S such that S = v + C and such that 
v = (0|b) where b € GF(q)"~*. 
Let (x|x’) € C where x € GF(q)* and x’ € GF(q)"~*. Then 
wu ((x|x’) + (O|b)) = w(x) + wn(x' + b) 
< wu(x) +n—-—k 
< wu((x|x’)) +n- k 





and so 


zvu lix’) +O]b)) > pnk wa ( (lx), 











Summing over all (x|x’) € C, the theorem follows. 





Corollary 1.9. Let C be an |n, k;q] code and D an [n,k + 1;q] code con- 
taining C. Then 


Ap(z) > {1 “qe 1)2”=* } Ao(2). 


Proof. Let S C D be a proper coset of C. By (1.10) and Theorem 1.18 
we have 


Ap(z) = Ac(z) + (4 — 1)A3 (2) > Ac(2) + (q — 1)z""*Ac(z). 
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Theorem 1.19. Let C be an [n,k;q] code and S a proper coset of C. Then 
1 yeh 


Ag(z) < Tra- tc’) 


for all z € [0,1], where y = (1 — z)/(1 + (q — 1)z). 


Theorem 1.20. Let C be an [n,k;q| code and D an [n,k +1; q] code which 
contains C. Then 


L q1) k+1 
Ao(z) > 1+- Dy 4 
q 


for all z € [0,1], where y = (1 — z)/(1 + (q — 1)2). 


p(z) 


Proof. By Corollary 1.9 we get 
Ages a(t aS 1)2) Ac+(y) 
serite- ts) (b+ len) Ao 
=q" (1+ (q - Dy) A(z) 
=q (1 E Ly) (Acl) + (q- 1)A3(2)) 


and the theorems follow. 























Corollary 1.10. If C is an |n, k;q] code and k < n, then 


(1+ (q— 1)z)” 


Ac(z) > a 


for all z € [0,1], where y = (1 — z)/(1 + (q — 1)z). 











Proof. The corollary follows from Theorem 1.20 by induction on k. 





1.4.7 Counting vectors in a sphere 


The sphere S;(x) of radius t around a vector x € GF(q)” is the set of 
vectors within Hamming distance t of x, that is 


Si(x) = {y € GF(q)” | du(x,y) < t}. 


Let N¿(i, j) be the number of vectors of weight j in a sphere of radius t 
around a vector of weight i. 


28 Codes for Error Detection 


Theorem 1.21. We have 


t min(| = |,n—-e) 


mid= D D (z ame e-a 


e=|i—j| ô=max(i,j)—e 








where 8 =e—i+ô, y=e—j+ôð, e=i+j-—e-—2ð. 
Proof. Let wy(x) = i and let y € S(x) such that wg (y) = j. Let 


a = #{l | zı = yı = 0}, 

B= #{l| zı =0,y # 0}, 

y= #{1 | zı # 0, yı = OF, (1.12) 
ô = #{1 | zı =y # OF, 

c= #{l| 2, 40,y 40,21 4 yi}. 





Then 
i= wu(x) =y+d+e, 
j=wnly) =6+6+., 
e= dy(x,y) = 8+7+e, a) 
n=at+6+y+6+e. 
Hence 
B=e-i+4, 
y=e-jto, (1.14) 
e=i+j—e-—206. 
Further, 
Ji-jl Se Xt, 
d=i-e+f>i-e, 
d=j-e+y>j-e, (1.15) 


d6=n-e-ax<n-e, 





26=i+j-—e-e<it+j-e. 
On the other hand, if e and 6 are integers such that (1.15) is satisfied, then 


there are 
a ‘| 
("5 ') saa D8 2 


ways to choose y such that (1.12)—(1.14) are satisfied. 














For q = 2, the terms in the sum for N;(i,7) are 0 unless € = 0. We get 
the following simpler expression in this case: 


[| 


NGA= DY tee) Gs (1.16) 


y=max(0,i— j) 
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1.4.8 Bounds on the number of code words of a given weight 


Some useful upper bounds on A; for a linear code are given by the next 
theorem. 


Theorem 1.22. Let C be a linear [n,k,d = 2t+1;q] code. If Ni(i,7) > 0, 
then 

(3) 
M(t, j) 


In particular, ford < i < |z] we have 


(7) 

















A; < i > fy < ` ~ 1+, 
ran (4-1) l (4-1) 
and, for |5| Si<n-t, 


Proof. Counting all vectors of weight j and Hamming distance at most 
t from a code word of weight i we get 
Ai Ni (i,j) < (") (q-1). 
In particular, N:(i, i — t) = (‘) > 0 for all i > d, and so 
a (q-1)** = rata ayes 
Similarly, Ni(i,i +t) = "7^ (4—1) >0 for d<i<n-—tandso 


(nay), 
A; Qo = p a: 
<a y fy” 


Theorem 1.23. For an |n, k;q] code C we have An < (q—1)*. 





Ai < 


S 

















Proof. Since equivalent codes have the same weight distribution, we may 
assume without loss of generality that the code is systematic, that is, it is 
generated by a matrix 

81 

G = (Ik|P) = 

Sk 
where I, is the k x k identity matrix, P is a k x (n — k) matrix, and 
21,---,8x are the rows of G. Ifc = ae aig; has weight n, then in 
particular a; = c; £ 0 for 1 < i < k. Hence there are at most (¢—1)* such 
c. 
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There are many codes for which we have An = (q—1)*. For example, 
this is the case for any code that has a generator matrix where all the 
columns have weight one. 


1.5 The weight hierarchy 


For a linear [n, k; gq] code C and any r, where 1 < r < k, the r-th minimum 
support weight is defined by 


dr = d,(C) = min{ #x(D) | D is an |n, r; q| subcode of ch. 





In particular, the minimum distance of C is dı. The weight hierarchy of 
C is the set {d1,d2,--- ,d,x}. The weight hierarchy satisfies the following 
inequality: 





—1 
d, > dr (1+ + ) (1.17) 
q—4 
In particular, we have 
d, > deg +1. (1.18) 
An upper bound that follows from (1.18) is the generalized Singleton bound 
dr <<n—k-+r. (1.19) 


1.6 Principles of error detection 


1.6.1 Pure detection 


Consider what happens when a code word x from an (n, M) code C is 
transmitted over a channel K and errors occur during transmission. If the 
received vector y is not a code word we immediately realize that something 
has gone wrong during transmission, we detect that errors have occurred. 
However, it may happen that the combination of errors is such that the 
received vector y is also a code word. In this case we have no way to tell 
that the received code word is not the sent code word. Therefore, we have 
an undetectable error. We let Pae = Pue(C, K) denote the probability that 
this happens. It is called the probability of undetected error. If P(x) is the 
probability that x was sent and P(y|x) is the probability that y is received, 
given that x was sent, then 
Pre(C,K) = $ P(x) DY) Pi»). 


xEC yeC\{x} 
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In most cases we will assume that each code word is equally likely to be 
sent, that is, P(x) = Under a assumption we get 


FE Plt. 


M yeC\{x} 
The quantity P,.(C, K) is a main parameter for describing how well C 
performs on the channel K, and it is the main subject of study in this 
book. In Chapter 2, we study Pue(C, K) for the q-ary symmetric channel, 
in Chapter 3 we describe results that are particular for the binary symmetric 
channel, in Chapter 4 we study other channels. 


o 
PaclC, 


Remark 1.1. It is easy to show that for any channel K with additive noise 
and any coset S of a linear code C we have Pae(C, K) = Pue(S, K). 


1.6.2 Combined correction and detection 


In some applications we prefer to use some of the power of a code to correct 
errors and the remaining power to detect errors. Suppose that C is an 
(n, M; q) code capable of correcting all error patterns with to or less errors 
that can occur on the channel and suppose that we use the code to correct 
all error patterns with t errors or less, where t < to. Let M:(x) be the set 
of all vectors y such that dy(x,y) < t and such that y can be received 
when x is sent over the channel. For two distinct x1,x2 € C, the sets 
Mi (x1), Mi(x2) are disjoint. If y € M:(x) is received, we decode into x. If 
y Z M(x) for all x € C, then we detect an error. 

Suppose that x is sent and y is received. There are then three possibil- 
ities: 
(1) y € M(x). We then decode, correctly, into x. 
(2) y g M(x’) for all x’ € C. We then detect an error. 


(3) y € M(x’) for some x’ € C \ {x}. We then decode erroneously into x’, 
and we have an undetectable error. 


Let PË = P&C, K) denote the probability that we have an undetectable 
error. As above we get 
PY(C,K)=S~ P(x) Y SS PO». 
xEC x'EC\{x} yEM:(x’) 


Assuming that or = x’ € C, we get 


wo E DY Pow 


Mee x/EC\{x} ye M(x’) 





) = yy fo 
(C.K) = 
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Comments and references 


Most of this material can be found in most text books on error- 
correcting codes, see the general bibliography. However, many of the 
books restrict themselves to binary codes. 

Again, this is mainly standard material. 

Some of this material is standard. Most textbooks restrict their pre- 
sentation to linear codes and, therefore, to the weight distribution. 
Theorem 1.3 is due to Pless (1963). 

Theorems 1.5 and 1.6 are due to Delsarte (1972). 

Binomial moments seems to have been used for the first time by 
MacWilliams (1963). Possibly the first application to error detection 
is by Klove (1984d). A survey on binomial moments was given by 
Dodunekova (2003b). 

Theorem 1.9 and Corollary 1.2 were given in Kløve and Korzhik (1995, 
pp. 51-52) in the binary case. For general q, they were given by 
Dodunekova (2003b). 

Theorems 1.10 and 1.11 is due to AbdelGhaffar (1997). 

Theorem 1.12 is essentially due to AbdelGhaffar (2004). Corollary 1.3 
(for q = 2) was first given by Fu, Klove, and Wei (2003), with a different 
proof. 

Theorem 1.14 is due to MacWilliams (1963). Theorem 1.15 (for q = 2) 
was given by Kasami, Fujiwara, and Lin (1986). 

Theorem 1.16 is from Klove (1992). 

Theorem 1.17 and Corollary 1.8 are due to Assmus and Mattson (1978). 
Theorem 1.18 is essentially due to Ancheta (1981). 

Theorem 1.19 with g = 2 is due to Sullivan (1967). An alternative proof 
and generalization to general q was given by Redinbo (1973). Further 
results are given in Kldve (1993), Klove (1994b), Kløve (1996c). 

We remark that the weight distribution of cosets can be useful in the 
wire-tap channel area, see Wyner (1975) and Korzhik and Yakovlev 
(1992). 

Theorem 1.21 is essentially due to MacWilliams (1963). In the present 
form it was given in Klgve (1984a). 

Theorem 1.22 was given in Kløve and Korzhik (1995, Section 2.2). 
Special cases were given implicitly in Korzhik and Fink (1975) and 
Kasami, Kldve, and Lin (1983). 

Theorem 1.23 is due to Kldve (1996a). 

The weight hierarchy (under a different name) was first studied by 
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Helleseth, Kløve, and Mykkeltveit (1977). The r-th minimum sup- 
port weight is also known as r-th generalized Hamming weight, see Wei 
(1991). The inequality (1.17) was shown by Helleseth, Klove, and Ytre- 
hus (1992) (for q = 2) and Helleseth, Klove, and Ytrehus (1993) (for 
general q). 

1.6. A more detailed discussion of combined error detection and correction 
is found for example in Kløve (1984a). 


This page intentionally left blank 


Chapter 2 


Error detecting codes for the q-ary 
symmetric channel 


The q-ary symmetric channel (qSC) is central in many applications and we 
will therefore give a fairly complete account of the known results. Results 
that are valid for all q are given in this chapter. A special, important case 
is the binary case (q = 2). Results that are particular to the binary case 
will be given in the next chapter. 


2.1 Basic formulas and bounds 


2.1.1 The q-ary symmetric channel 


The gary symmetric channel (qSC) with error probability parameter p is 
defined by the transition probabilities 


1-—pifb=a 
P(bla) = i i 


The parameter p is known as the symbol error probability. 


2.1.2 Probability of undetected error 


Suppose x € Fọ is sent over the q-ary symmetric channel with symbol er- 
ror probability p, that errors are independent, and that y received. Since 
exactly dy(x,y) symbols have been changed during transmission, the re- 
maining n — dy(x,y) symbols are unchanged, and we get 


n—dy (x,y) 


p o 7 p 


Pø =(5 


Assume that C is a code over Fy of length n and that the code words are 
equally likely to be chosen for transmission over qSC. For this situation, 
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we will use the notation Pye(C,p) = Pue(C,qSC) for the probability of 
undetected error. It is the main subject of study in this chapter. 
If Ac(z) denotes re distance distribution function of C, then 


PAGD 1 S S 2)” (x,y) 


M OTET 
-o =p 

pr Yale (ooo) 

= (1 See — i}. 


We state this basic result as a theorem. 


1— n—dy (x,y) 
7I (1-p) 


II 


Theorem 2.1. Let o be an (n, M; q) code. Then 


BiGy el = 3 2 (x,y) 


Mo yeC\{x} 


-Yao 0-9 


1— n—dy (x,y) 
Fa (1 —p) 


-a-r [aola 
Eo e 

An (n,M;q) code C is called optimal (error detecting) for p if 
Pae(C,p) < Puae(C’,p) for all (n, M; q) codes C’. Similarly, an [n, k; q] 
code is called an optimal linear code for p if Pue(C,p) < Pue(C’,p) for all 
În, k; q] codes C”. Note that a linear code may be an optimal linear without 
being optimal over all codes. However, to simplify the language, we talk 
about optimal codes, meaning optimal in the general sense if the code is 
non-linear and optimal among linear codes if the code is linear. 

When we want to find an (n, M;q) or [n,k;q] code for error detection 
in some application, the best choice is an optimal code for p. There are 
two problems. First, we may not know p, and a code optimal for p’ Æ p 
may not be optimal for p. Moreover, even if we know p, there is in general 
no method to find an optimal code, except exhaustive search, and this is 
in most cases not feasible. Therefore, it is useful to have some criterion by 
which we can judge the usefulness of a given code for error detection. 

We note that Pye (c, 1) = aa It used to be believed that since 
p= a is the ” worst case”, it would be true that Pye(C,p) < M =! for all 








Error detecting codes for the q-ary symmetric channel 37 


p € (0, =). However, this is not the case as shown by the following simple 
example. 


Example 2.1. Let C = {(a, 0,0) | a,b € Fy}. It is easy to see that for each 
code word c € C there are 2(q — 1) code words in C at distance one and 
(q — 1)? code words at distance 2. Hence 
Ay = 1, Ai =2(q- 1), A = (q - 1)’, 
and 
Pa(C, p) = 4 2 Ea _ 21,2 
ue(C, p) =E a Y +(q-1) as (1—p) = 2p(1—p)"+p*(1—p). 


This function takes it maximum in (0, i) for p=1—- Ft In particular, 


-1 
= Pa (c, =) 


g-1 
q8 


1 
Pa(C,1- ae 


2 
= —— 0.3849 > 
z) 3v3 





for all q > 2. 


In fact, Pae(C, p) may have more than one local maximum in the interval 


[0, (q — 1)/q]. 


Example 2.2. 
Let C be the (13,21;2) code given in Table 2.1. 


Table 2.1 Code in Example 2.2. 


(1111111111110) (1111000000000) (1100110000000) (1100001100000) 
(1100000011000) (1100000000110) (0011110000000) (0011001100000) 
(0011000011000) (0011000000110) (0000111100000) (0000110011000) 
(0000110000110) (0000001111000) (0000001100110) 
(1010101011101) (0101011001000) (1010101010101) 
(1010010110011) (1001100101011) (0101100110101) 


The distance distribution of C is given in Table 2.2. 


Table 2.2 Distance distribution for the code in Example 2.2. 
i de®. #27 = 23 4 5 6 7 8 9 10 11 12 I3 


210A; 1 0 0 52 10 9 68 67 #1 2 0 0 0 


The probability of undetected error for this code has three local maxima 
in the interval [0, 1/2], namely for p = 0.0872, p = 0.383, and p = 0.407. 
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An (n, M; q) code C is called good (for error detection) if 





—1 M-1 
Pae(C, p) < Pue (c, 1—) = Ti (2.1) 
q q 
for all p € [0, 11. Note that ”good” is a technical and relative term. 


An extreme case is the code Fẹ which cannot detect any errors. Since 
Pue( Fy, p) = 0 for all p, the code is ” good” in the sense defined above even 
if it cannot detect any errors! 

An engineering rule of thumb is that if a code, with acceptable param- 
eters (length and size), is good in the sense just defined, then it is good 
enough for most practical applications. It has not been proved that there 
exist good (n, M; q) codes for all n and M < q”, but numerical evidence 
indicates that this may be the case. 

We shall later show that a number of well known classes of codes are 
good. On the other hand, many codes are not good. Therefore, it is 
important to have methods to decide if a code is good or not. 

A code which is not good is called bad, that is, a code C is bad 
if Pae(C,p) > vo for some p € [0, =]. If C satisfy the condition 
Pae(C,p) < a for all p € (0, T]; we call it satisfactory. Clearly, ” satisfac- 
tory” is a weaker condition than ” good”. A code that is not satisfactory is 

1 


called ugly, that is, a code C is ugly if Pae(C, p) > a for some p € [0, T]. 


Some authors use the term good for codes which are called satisfactory here. 





The bound a in the definition of a satisfactory code is to some extent 
arbitrary. For most practical applications, any bound of the same order of 
magnitude would do. Let C be an infinite class of codes. We say that C is 
asymptotically good if there exists a constant c such that 

Pae(C, p) < cPae (0, —) 
for all C € C and all p € [o, 11 . Otherwise we say that C is asymptotically 
bad. 

A code C is called proper if Pue(C,p) is monotonously increasing on 
(0, |. A proper code is clearly good, but a code may be good without 
being proper. 

A simple, but useful observation is the following lemma. 


Lemma 2.1. Fori < j and p € [o, a] , we have 
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Proof. We note that (2.2) is equivalent to 
a) 2 =. 
Gaia p) © Aa- I- p) 
and this is satisfied since 


P 
oaa 














When we want to compare the probability of undetected error for two 
codes, the following lemma is sometimes useful. 


Lemma 2.2. Let £1, £2,... £n and V1, Y2,- -.-, Yn be real numbers such that 


LI> T2- > Xy>0 





and 


j 
XO y0 for j=1,2,...,n 


i=l 


Then 
i=1 


Proof. Let oj = 71+72+:-::+7;. In particular, co = 0 and by assump- 
tion, øj > 0 for all j. Then 


3 Viti = Be Oi — OF-1 \a, = A Oiti — > Oifi+1 


g= 


= outa + Sle Zi — Li41) > 0. 














Corollary 2.1. If C and C’ are (n,M;q) codes such that 


AAC) < DA (C") 


for all j =1,2,...,n, then 


for all p € [0, (q — 1)/q]. 
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Proof. The results follows from Lemma 2.2 choosing y; = Ai(C”)— Ai(C) 


and x; = (4) (1 — p)"~*. Lemma 2.1 shows that the first condition in 








Lemma 2.2 is satisfied; the second condition is satisfied by assumption. 





Example 2.3. We consider the possible distance distributions of (5, 4; 2) 
codes. There are 38 different distance distributions of (5,4;2) codes; of 
these 10 occur for linear codes. It turns out that 2A; is always an integer. 
Therefore, we list those values in two tables, Table 2.3 for weight distribu- 
tions of linear codes (in some cases there exist non-linear codes also with 
these distance distributions) and Tables 2.4 for distance distributions which 
occur only for non-linear codes. 





Table 2.3 Possible weight distributions for linear [5, 2; 2] codes. 


2A1 2A2 2A3 244 2A5 type? no. of nonlinear no. of linear 
0 0 4 2 0 P 0 3 
0 2 0 4 0 P 0 2 
0 2 2 0 2 P 0 2 
0 2 4 0 0 P 12 6 
0 4 0 2 0 P 24 3 
0 6 0 0 0 S 4 2 
2 0 0 2 2 G 0 1 
2 0 2 2 0 G 0 4 
2 2 2 0 0 S 24 6 
4 2 0 0 0 U 0 2 


aP: proper, G: good, but not proper, S: satisfactory, but bad, U: ugly. 


We note that if C is a (5,4;2) code, and we define C’ by taking the 
cyclic shift of each code word, that is 


C” = {(c5, €1, C2, €3, €4) | (€1, C2, C3, C4, C5) € CH, 


then C’ and C have the same distance distribution. Moreover, the five 
codes obtained by repeating this cycling process are all distinct. Hence, 
the codes appear in groups of five equivalent code. In the table, we have 
listed the number of such groups of codes with a given weight distribution 
(under the headings ”no. of nonlinear” and ”no. of linear”). 

Using Corollary 2.1, it is easy to see that Pue(C,p) < Pue(C’,p) for 
all (5,4;2) codes C and all p € [0,1/2], where C” is the linear [5, 2; 2] 
code with weight distribution (1,2,1,0,0,0). A slightly more complicated 
argument shows that Pue(C,p) > Pue(C”, p) for all (5, 4; 2) codes C and all 
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Table 2.4 The other distance distributions for (5, 4; 2) codes. 


N 
= 
N 
D 
N 
N 
> 
wo 
N 
> 
a 


2A5 type? no. of nonlinear 


24 
24 
12 
24 
48 
32 
24 
24 
24 
24 
16 
24 
72 
48 
24 
12 
60 
48 


WWNNNNNNNFPRPRPREEP EPR RE EE OOOCOCOOCOSO 
WNWNFEF RF ROWNNNNFRPREOUNWWWNNNEE 
CREPE NEF RFORP NWNHNHFWNNHFWOWNOWWNAW 
SCOOHENHENNDOHOHHENENNHOOWHOHE DN 
SCO OCOOORFFOOCORFOORHFOCORCORFOSO 
BGAMANNMNNNNQMDNDUVUVUVUVIIVIVUNHRHVYVVIUVUU 


aP: proper, G: good, but not proper, 
S: satisfactory, but bad, U: ugly. 


p € [0,1/2], where C” is the linear [5,2;2] code with weight distribution 
(1,0,0,2,1,0). 


For a practical application we may know that p < po for some fixed po. 
If we use an (n, M,d;q) code with d > pon, then the next theorem shows 
that Pae(p) < Pue(po) for all p < po. 


Theorem 2.2. Let C be an (n,M,d;q) code. Then Pue(C,p) is 


monotonously increasing on [o. z] . 


Proof. Since p'(1 — p)"~* is monotonously increasing on [o, il, and in 
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In 


particular on [o 4] for all i > d, the theorem follows. 


2.1.3 The threshold 


Many codes are not good for error detection (in the technical sense). On 
the other hand, p is usually small in most practical applications and (2.1) 
may well be satisfied for the actual values of p. Therefore, we consider the 
threshold of C, which is defined by 


6(C) = max f» € [o, a) | P,a(C,p) < Pua(C. 1—1) for all p € o.v} i 
q q 


(2.3) 
For p < 6(C) the bound (2.1) is still valid. In particular, C is good for error 
detection if and only if 0(C) = (q — 1)/q. Note that 0(C') is a root of the 
equation Pya(C,p) = Pua(C,(¢q — 1)/q), and it is the smallest root in the 
interval (0, (q — 1)/q], except in the rare cases when Pya(C,p) happens to 
have a local maximum for this smallest root. To determine the threshold 
exactly is difficult in most cases and therefore it is useful to have estimates. 


Theorem 2.3. Let w(6;q) be the least positive root of the equation 


(Jawa. 


q—1 q 
If C is an (n, M,d; q) code, then 0(C) > y(d/n;q). 


Proof. For p<w=w(d/n;q) we have, by Lemma 2.1, 
= Z . p p — pni 
Pua(C, p) SAT (1—p) 


<(2.)*a-y* 4 
i=d 


q-1 
- (A 0-9) -1 
< (Ha-ya 


= (M ~1) Pal, 1). 


II 





Hence 6(C) > a. 
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Example 2.4. For m > 1, let Cm be the binary code generated by the 
matrix 

1...10...00...0 

0...01...10...0 

0...00...01...1 


Clearly, Cm is a [3m,3,m; 2] code and 

Pue(Cm,p) = 3p™(1 — p)?™ + 3p°™ (1 — p)” + p°”. 
For m < 3, Cm is proper. The code C4 is good, but not proper. For the 
codes Cm, d/n = 1/3. For m > 5, we have 


Pae(Cm,1/3) _ 3(4/27)™ _ 3 732\m 
P,e(Cm,1/2) = 7/8" salo) a 


and the code Cm is bad. 


We have 
Gave) 


and so (1/3; 2) ~ 0.190983. Hence 0(Cm) > 0.190983. On the other hand, 
if om is the least positive root of 30™(1 — 0)?” = 7-273”, that is 


eae 


a(l — 0)? = (= 


2 ? 
then 


1 
Pae(C, 0) = 30™ le)" 430" (1 — o)” tor >7.279™ = Pae (c, 5) 


and so 0(Cm) < am. Since (3/7)'/™ — 1 when m — ov, we see that 
Om — W(1/3,2). In Table 2.5 we give the numerical values in some cases. 
The values illustrate that om is a good approximation to 0(Cm). 


Table 2.5 Selected values for Example 2.4. 


5 0.3092 0.3253 
6 0.27011 0.27055 
10 0.22869306 0.22869335 


30 0.201829421660768430283 0.201829421660768430299 
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2.1.4 Alternative expressions for the probability of unde- 
tected error 


There are several alternative expressions for Pae(C, p). Using Lemma 1.1, 
we get 


Aol -— DG zp) 7 ae -p "aa(t 7 a) 


and so Theorem 2.1 implies the following result. 
Theorem 2.4. Let C be an (n, M;q) code. Then 
M qp f 
Pue(C,p) = —A&(1 = -2 ) =(= p) 
q q—-1 
In particular, if C is a linear |n, k; q] code, then 


Pae(C, p) = g0” Acı (1 = -2) -(1- p)” 
Ta 


av qp_\* 3 
=a DC aa) =p) 


Example 2.5. As an illustration, we show that Hamming codes are proper. 


The [n = ak = Care — ma] Hamming code C is the dual of the 











q-1 
weight g™—1! = (n(q—1)+1)/q and q™—1 = n(q— 1). Since g'&-" = q>™ = 
1/(n(q— 1) + 1), we get 


q 
kS „mM; q| Simplex code Sm. All the non-zero code words in Sm have 


n(a—1)+1 


1 q a 
PAC =—_—_(1 -1(1-—* 9) )-a- n, 
(C, p) mE +n(q- 1) TEU (=p) 
Hence for p € (o, 11 we get 
d n(q—1)+1 4 
— Pye(C,p) = -n(1 — — p) +n(1—p)?-? 
dp q-1 
(n-1)(a~1) j (m-D4-1) 
=n (a-pe) © -(1-—p) * }>0 
q—1 
since 


(1 —p)/@-D > (1 _ aa"): 


Therefore, C is proper. 
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We go on to give one more useful expression for Pue(C, p). 


Theorem 2.5. Let C be an (n, M;q) code. Then 
ProlC, Ag(C (sy -) (1 = ae 
p) -5 TE 


Proof. Combining Theorems 1.7 and 2.1, we get 
— =z, n ae = 


= LAO GS na =) 6 {g= na =) 


-Sro 














2.1.5 Relations to coset weight distributions 


We here also mention another result that has some applications. The result 
follows directly from Theorem 1.19. 


Theorem 2.6. Let C be an [n,k;q| code and S a proper coset of C. Let 0 
be sent over qSC, and let y be the received vector. Then 


Priyes) _ py AY (Gat) tl = 2) 
Priy€C) (1 —p)"4o(qatsy) 


2.2 Pye for a code and its MacWilliams transform 
Let C be an (n, M;q) code. Define PŁ (C, p) by 


Z p ' n—i 
=Saro(—) a-p. 
i=1 g= 
If C is linear, then Theorem 1.14 implies that P4(C,p) = Pae(C+, p). 
Similarly to Theorem 2.4 we get 
1 qp 
PŁ(C,p) = —A (-)-a- n, 24 
feat p) M C q 1 ( p) ( ) 
Theorem 2.7. Let C be an (n, M;q) code. Then 


-1- M 
Pae(C, p) = M(1 - p)"PL(C, aoa) + y (1- p)” 
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and 

PEC) = DU- p)" Re(0 —®) + (1 = 
Proof. From (2.4) we get 
Pae(C,p) = (1 SA Gaya) z 1} 


=(1-py"{4c(1- = 1—2) 1} 


q—1 q-qp 


=(1-p)"{ mpi (c, ——*) + u(1-—*)"-1} 
q= qp q= qp 


=T= 
= M(1 - p)" P(c, —$—* 


) M 
q — qp 


+—-(1-p)". 
q 











The proof of the other relation is similar. 





The dual of a proper linear code may not be proper, or even good, as 
shown by the next example. 


Example 2.6. Consider the code C; defined in Example 2.4. It was shown 
in that example that Cs is bad. On the other hand, 


1 
Pye(Of Pp) = {1+ 3(1 — 2p) +301- 2p)" + (1 — 2)" } - 0- p)”, 
and this is increasing on [0, 1/2]. Hence C is proper, but the dual is bad. 


There are a couple of similar conditions, however, such that if Pue(C, p) 
satisfies the condition, then so does P(C, p). 


Theorem 2.8. Let C be an (n, M; q) code. 


q 


if and only if Pīe(C,p) < 1/M for all p € [o, a]. 


(2) Pae(C,p) < Mq™{1— (1p) } for alt p € [0, =], 

if and only if PE(C, p) < 1/M{1 — (1 — p)"=* } for all p € [0,4]. 
(3) Pal, p) < HEH 1- (1 p)”} for alt pe [0, =], 

if and only if P(C, p) < ET {1 - (1 = p)” } for all p € [0, I. 


(1) Pac(C,p) < Mq~” for all p € lo, s], 
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Proof. Assume that Pue(C,p) < Mq” for all p € lo, s]. Using The- 
orem 2.7 we see that if Pue(C,p) < Mq-”, then 


n Es 1 
PŁ(C+,p) = a - p)” Pa( C, ow) tae (1-p)” 


q— qp M 
q” M 1 1 
< (1 -p — + — - (1 —- p)” = —. 
S p) mT: (1 — p) i 











The proof of the other relations are similar. 





Note that for a linear code C, the relation (1) is equivalent to the state- 
ment that C is satisfactory if and only if C+ is satisfactory. 


2.3 Conditions for a code to be satisfactory, good, or proper 


2.3.1 How to determine if a polynomial has a zero 


In a number of cases we want to know if a polynomial has a zero in a given 
interval. For example, if HPuelC, p) > 0 for all p € (o, 11), then C is 


proper. If Pye Q 1) — P,e(C,p) > 0 then C is good (by definition). If 
Ac, (z) — Ac,(z) > 0 for z € (0,1), then Pye(C1,p) > Pue(C2,p) for all 
pE (o, st), and so C% is better for error detection than C1. The simplest 
way is to use some mathematical software to draw the graph and inspect 
it and/or calculate the roots. However, this may sometimes be misleading. 
Therefore, we give a short description of the systematic method of Sturm 
sequences. 


Let f(z) be a polynomial and |a, }] an interval. 
Define the sequence fo(z), fi(z),:-: , fm(z) by the Euclidean algorithm: 


folz) n f(z), 

fil) = F'(2), 

Fi(z) = —fi-2(z) (mod fi_i(z)), deg(fi(z)) < deg(fi-1(z)) for 2 < i < m, 
fim(z) divides fm_—i(z). 


Define go(z), g1(2),:** , gm(2) by 
fil) 
gi(z) = =. 
2) T 
If go(a) = 0 we divide all g;(z) with (z — a); and if go(b) = 0 we divide all 
gi(z) with (z — b). 
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Lemma 2.3. The number of distinct zeros for f(z) in (a,b) is given by 
#{i | g-1(a)gi(a) <0, 1 < i < m} — Hi | Gir (0) gi(b) < 0, 1 <i < m}. 


Another method which is not guaranteed to work, but is simpler when 
it does, is to rewrite f(z) into the form 


fe =Y ul- ab- 2) 


M EM 


where n = deg(f(z)). If a; > 0 for all i, then clearly f(z) > 0 for all 
z € [a,b]. For example, this is the method behind Theorem 2.5. 


Example 2.7. As a further example, we show that the [2™,2™ — m — 1,4] 





extended Hamming code is proper. For convenience we write M = 2™71, 
Then 
1 
Palp) = Tyo {1 + (4M = 2)(1 = 2p)™ + (1 = 2p)™ } — (1 = p)™, 


and so 
L Pulp) = 2M(1 apy" — (2M — 1)(1 — op)" 7! — (1 — 2p)?°™-! 
Ip 
= 2M(p+1-—2p)?™-" 
(2M — 1)(2p +1 — 2p)” (1 — 2p) t — (1 — 2p)?! 
2M—1 





=2M > = = p(l _ apa 
M 
=0 
2M— 
Sy wal = 2p) ee 1-7 
1=0 


where ao = Q, = Q2 = 0, and 
ila; = 2M(2M — 1)(2M — 2){T] emu —)-JJem—25+ 2)} >0 
j=3 j=3 
for 3 < i < 2M — 1. Hence £ Pac(p) > 0 for all p € [o, 4); that is, C is 
proper. 
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2.3.2 Sufficient conditions for a code to be good 


Theorem 2.5 may be useful to show that a code is good or proper. First, 


Pa(C#4) = MENG Baa otp)" 








q” gol ge 
_M-1 “(n if p \* q n—i 
-ZSE = he) 
eo q-1 q—1 
Therefore, if 
(M—-1 
ap < LAD (8) (2.5) 


for 1 <i < n, then C is good. We can now use the results in Corollary 1.7. 
For 1 < i < d we have A? = 0 and so (2.5) is satisfied. For 0 < n — i < dt 
we have A? = (Mq’-" — 1)(") and so 


q (we 1) (") - ae a(t -am (5) >0 


and (2.5) is again satisfied. Therefore we get the following theorem. 


Theorem 2.9. Let C be an (n,M,d;q) code with dual distance d+. If 
A? < M-1) n 
t — qi 4 
ford<i<n-—d-, then C is good. 


2.3.3 Necessary conditions for a code to be good or satis- 
factory 


We will give some necessary conditions for a code to be good. 


Theorem 2.10. Let C be a good (n, M;q) code. Then, for alli, O<i<n 
we have 


M-1 n”(q-1' __M-!1 
AG Fin — Art pOr 





Proof. Choosing p = i we get 


M2 a Aa 


2 a(z ()""= aa 
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This proves the inequality. The equality follows directly from the definition 
of the g-ary entropy function H,(z) since 
— z K 4 —z 
qe) = (—) (1—z)**. 
q-1 
Theorem 2.11. Let C be a satisfactory (n,M;q) code. Then, for all i, 
0<i<n we have 














Ae E ba 
q” i(n —i)r-* 
aa eea E 


M ülni) i 
Proof. The proof of the first inequality is similar to the proof of Theorem 
2.10. For the second inequality, we first note that 


Mq™” > Pael(C, p) 


= j 
= Mq” +Mq-” a(i- r) —(1— p)” 
q q 2 j pai?) ~G-») 





=>Mq"+ Mq"At (1 = r) =(= p)” 
q— 
and so 


q yc C n 
a areart Sigea 


(n=) (4-1) 

aay) ae 

"\(n-i)(q-1/J T M\n-i/ ` 

Theorem 2.12. Let C be a good (n,M;q) code. Then 
MA? < (q—1)n. 


Choosing 1 — —— p = 


J we get 

















Proof. From Pae(C, p) = Mq” Y; o AL(1 — 4p) — (1 — p)” we get 


d 7 es qi q i—1 
L Pel p = n0 -pt -My Aa i 
7 (C, p) =n(1— p) q D A S) 


Since Pae(C, p) cannot be decreasing at p = a for a good code, we get 
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A class of necessary conditions can be obtained by considering a 
weighted average of Pae(C, p) over some interval. We first formulate this as 
a general theorem, and thereafter we specialize it in two ways. 


Theorem 2.13. Let C be a good (n, M; q) code. Let 
O<a<b<(q-1)/q, 


and let f(p) be a continuous non-negative function on [a,b] such that 
f? f(p)dp = 1 (that is, a probability distribution). Then 


M — 
go 





b 
/ f(p)Pae(C, p)dp < 


Proof. Since C is good, Pue(C,p) < E for all p € [a,b], and so 





M — 


b b 
f f(D) Pae(C, p)dp < i, f(y) — 
a a q 


Wet f? M1 
= J flier = 2 
q je q 


1 
dp 























To compute the integral i f(p)Pae(C, p)dp we use one of the explicit 
expressions we have for Pye(C,p). Let us first consider a constant f(p), 
that is f(p) = z for all p. For this case, it is most convenient to use 
Theorem 2.4, that is, 


n 


i=0 q-1 


This gives 











e Ga) | 
-yoe -at (1-0). 


This gives the following corollary. 
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Corollary 2.2. Let C be a good (n, M; q) code and let 
0<a<b<(q-1)/q. 











Then 
= nm At a \itl i+1 
i EO 0-4)" -0- A) 
goer li = a)” t! = a = pyre 
< M-1 
qn 


For the next special case, we let a = 0 and b = (q—1)/b and for f(p) we 
use a so-called beta function. More precisely, let a and 8 be non-negative 
integers. Then (see e.g. Dweight (1961)) 


“adopta a 
het-a Gui 





Hence 


(q-1)/q p a qp B E q— 1 
(Ea) (0 e ee. 
0 q-1 q—1 (a+ B+ 1)gett(*3.) 
Therefore, a possible choice for f(p) is 


o (a+ B+ 1qrt (04) y p ye T NP 
(Oger ea) Ogee) 
For the application of this weighting, the most useful form of Pae(C, p) is 
the one given in Theorem 2.5: 


ren- Soastey Pa) (1 2%)" 


q—1 
This gives 


(q-1)/q 
if t(p)Pao(C, pdp 


WERE DPE TY 
ga dy" ane 
(a+ B+ DEH CD) S gore 


q-1 


(q-1)/q p \tta qp \n-i+68 
Gi aa 2 
q-1 q-1 


q—1 
i=1 EETA 1)q peor (ALOT) 
Ag(C) _(*2°) 


tae). 














(a+ 6+4+1) 
~ (n+a+ß+1)4 





i M 
I 
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Since 
AEC). 0. A 
(nta+B+ij(rterP) (Prater) 


we get the following corollary. 


Corollary 2.3. Let C be a good (n,M;q) code and let a and B be non- 
negative integers. Then 


57 AO) Con) ~M-1 
ee. ae 














i=1 

For a linear [n, k, d; q] code, a general lower bound on Ag is g—1, and for 
a non-linear (n, M, d; q) code a general lower bound on Ag is 2/M. Now, let 
A be some positive number. We will consider (n, M,d;q) codes for which 
Aq > A. In the rest of the subsection we also use the notations 








Q= x i and « = In(M/A) = ln M — hn A. 
q— 
By definition, 
1 A 
Pic(Cp) > a a OA 
Hence, if 
A d n 
£a- Qn)" > (1—2)", (2.6) 


then PŁ(C,p) > 77. Taking logarithms in (2.6), we get the equivalent 
condition 
—«K+dln(1 — Qp) > nln(1 — p). 
Combining this with Theorem 2.8, 1), we get the following lemma. 


Lemma 2.4. If C is an (n,M,d)q code and 


dln(1 — Qp) -Kk 
n > h(p) = D H 
then C is ugly. 


Any choice of p, 0 < p < (q — 1)/q now gives a proof of the existence of 
a u(d, K) such that if n > u(d,«) and C is an (n, M,d) code with Aa > A, 
then C is ugly for error detection. The strongest result is obtained for the p 
that minimizes h(p). We cannot find a closed formula for this, but consider 
approximations. 
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We will use the notations 


f(p) = ein and g(p) = ae 


Then 


h(p) = d f (p) + s g(p). (2.7) 


The function f(p) is increasing on (0, (q— 1)/q), it approaches the value 
Q when p —> 0+, and it approaches infinity when p — (q—1)/q—. Moreover, 


_ Q0 = p)InG = p) + A= Qp) al = Qp) 


f'(p) (1 —p)(1 — Op) hnll- py? 
and 
" a= pe ee eN at 
f") = —(1 — p2(1 — Qp)2(In(1 — p))3’ 
where 


filp) = Q?(1 — p)*(In(1 — p)? + 2Q(1 — p)(1 — Qp) (1 — p) 
—2(1 — Qp} In(1 — Qp) — (1 — Qp)? In(1 — p) n(1 — Qp) 
>0 





for all p € (0, (q — 1)/q). Hence f is convex on (0,(q — 1)/q). Similarly, 
the function g(p) is decreasing on (0, (q — 1)/q), it approaches infinity when 
p — 0+, and it takes the value —1/Ing for p = (q — 1)/q. Moreover, 


(1—p)In(1—p)?  (L—p)(1— Qp) In(1 — p)?’ 


—(2+In(1 — p)) 
(1 -— p)? (n(1 — p))’ 


for all p € (0, (q — 1)/q), and so g(p) is also convex on (0, (q — 1)/q). This 
implies that the combined function h(p) is also convex on (0, (q— 1)/q) since 
k > 0, and it takes its minimum somewhere in (0, (q—1)/q). We denote the 
value of p where minimum is obtained by pm and the minimum by p(d, K). 

From Lemma 2.4 we get the following necessary condition for a code to 
be good. 


>0 


go" (p) = 


Corollary 2.4. If C is good for error detection, then n < u(d, K). 
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One can find good approximations for u(d, K) when d >> k or k > d. Here 
we give some details for d > k. Let x = ad, where a is a parameter, 
O0O<a<1. Then 





np) = aCe 
and 
hi(p) _ —Q(—p) In ~ p) + (1 = Qp) nd — Qp) _ a 
d (1 — p)(1 — Qp) In(1 — p)? (1 -— p)ln(1 = p)? 


In particular h’ (p) = 0 if (and only if) 


o QU- p)a(t = p) + (1 = Qp) a(l — Qp) 
1—Qp 

We want to solve this for p in terms of a. There is no closed form of this 

solution. However, we can find good approximations. For a — 0+, we see 

that p — 0 and h(p) —> Q. We will first describe this important case in 

more detail. We note that a — 0+ implies that d — oo. The parameter «k 

may also grow, but then at a slower rate (since d/K — 0). 


"= ANOT 


There exist numbers a; and b; for i = 1,2,... such that, for any r > 0, 


. (2.8) 


Theorem 2.14. Let 


; 
Pm = > aiy' + O+), 


i=] 
and 
p(d, ad) = dQ fı +2(Q—1) J biy + owr») 
w=1 


when y > 0 (that is a — 0). The first few a; and b; are given in Table 2.6. 


Proof. First we note that a = 2Q(Q — 1)y? and so 





and 


H(p,y) 


h'(p) =" ope = Onin = pe: 
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Table 2.6 a; and b; for Theorem 2.14 








i Qi bi 

1? 2 1 

2 —(8Q +2)/3 (2Q — 1)/3 

3 (26Q? + 22Q -— 1)/9 (2Q? — 2Q — 1)/18 

4 (368Q° + 708Q? — 12Q + 8)/135 2(Q — 2)(2Q — 1)(Q + 1)/135 


where 


H(p,y) = —Q(1—p) (1 —p) + (1 — Qp) n(1 — Qp) — 2Q(Q — 1)y?(1 — Qp). 


Hence h'(p) = 0 if H(p,y) = 0. Taking the Taylor expansion of 
H(X aiy’, y) we get 








; aj —4 a 
H (Y aiy, y) = ty? + Bla? + a? + Gay + 12Q)y* ++ 


All coefficients for i < r should be zero. In particular, the coefficient of y? 
shows that a? = 4. Since a y? is the dominating term in the expression 





for p when y is small and p > 0, we must have a; > 0 and so a, = 2. 
Next the coefficient of y? shows that a2 = —(16Q + 4)/6. In general, we 
get equations in the a; which can be used to determine the a; recursively. 
Substituting the expression for p into h(p) and taking Taylor expansion, we 
get the expression for (d, K). 














Assuming that ka — 0 and taking the first three terms of approxima- 
tion, we get 
2Q-1 
K, 
3 





uld, K) = dQ + /2dkQ(Q— 1 


(the other terms goes to zero with y). 

By definition, h(p) is an upper approximation for any p. One way to 
get a good upper approximation is to choose for p a good approximation 
for pm. For example, taking the first term in the approximation for pm, 
that is, p = \/2a/(Q(Q — 1)), we get 


wd, x) < h (y2af(Q@=1)). 


By similar analysis, one can determine approximations for u(d, K) when 
k is larger than d. The main term is 


K 
d x—. 
a(d, K) ig 
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2.3.4 Sufficient conditions for a code to be proper 
An immediate consequence of Theorem 2.2 is the following theorem. 


Theorem 2.15. If C is an (n,M,d;q) code and d > in, then C is 
proper. 


A related condition is the following. 


Theorem 2.16. If C is an (n,M,d;q) code and d+ > in, then C is 
proper. 


Proof. Any code of length one is proper, so we may assume that n > 2. 
Let 


FP) = PeZ (1- “Ba -9". 


tds 
Then 
es Toad qp_\* 
rosal- ten Deg ae) 
i=dt q~ 
Hence 
i-l 
-a2 
ro) „0-A 
E es fe =e 2; 
n(1 —p)r-t Wo Wynn jyn=l ye joai ( 9) 
We note that d+ > in implies that 
—1 1 —1 
d -1> ED i n=, 


Hence, for all i > d+, 


i-1 d+-1 (n-1)(q-1)/ 
e Ea Use) 
q— 


( - 2) je Vo Dene 
pe ay Se rn 
G-p $ (= pa 5) i 


Therefore, by (1.4) 


f'(p) M a 
=>>] iA; 
n(1 -p15 ng- 1) 3 
n—1 
ay es SAG = A= An si: 


n(q= 1)! M 
Hence, f'(p) > 0 for all p € [0, (q — 1)/q] and so C is proper. 
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Remark 2.1. The bound in the theorem cannot be improved in general, 
in the sense that there exist bad (n, M, d; q) codes for which d+ = tn. 


The idea of the proof of Theorem 2.16 can be carried further. Let 
0<a < (q-— 1)/q. If h(p) = (1 — 4p) , then 


and 
2 a—2 
h (p) = a(a — (+) (1 a — p) <0. 
q— 
Hence, the equation 
q Q 
1-4») hes 
( q— 1” £ 

has a unique solution p(q, a) in the interval 0 < p < T We observe that if 


a B 
a < 6, then (1- r) <(1- +p) for all p. Hence, p(q, a) > plq, 8). 


Theorem 2.17. If C is an (n,M,d;q) code and 
d d+ —1 
— 2 (a. i 
n 


n— 1 





then C is proper. 
Proof. By (2.9), 


Pa d+—1 
F'(p) A) W Sai 
n(l—p)r* = (=p nqa- m 
d+—1 
(1 = it.) n=l 
q—1 M q 
Sipe ng Dart TE eA 


V 
= 
| 
~~ 


L 
Ta aye —1)/(n-1) 
q—1 

(1 —p) 


for 





d+=—1 q—1 
Caer ee 
n—-1 q 


Hence, f(p) is increasing in this interval. On the other hand, by Theorem 
2.2, f(p) is increasing on [0, d/n]. Combining these two results, the theorem 
follows. 
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Example 2.8. The equation for p(2,1/3) is (1 — 2p)'/3 = 1 — p, that is, 
1 — 2p = (1 — p)? which has the solution p(2, 1/3) = (3 — V5)/2 © 0.3819. 
Hence, if C is an (n, M,d;2) code such that (d+ — 1)/(n — 1) > 1/3 and 
d/n > 0.3819, then C is proper. 


Example 2.9. Consider a code C for which d > 2 and d+ = in. Let 
a= (S4n-1)/(n-1). If 
q T 2 
1—-——-) <1l-- 2.1 

( q—ln/ 7 n’ (2:20) 
then p(q,a) < 2/n and the code C is proper by Theorem 2.17. A careful 
analysis shows that (2.10) is satisfied for n > 5 when q = 2 and for n > 3 
when q > 3. 


To formulate the next sufficient condition for a code being proper, we 
define the functions 


vo- SOGDU 


Theorem 2.18. Let C be an (n, M; q) code. Then 
AS (C ZACCO A&A (C 
PulCs0) = Fn DAE - Gtr pM 


i=2 (") q (lamt 


Ne 


Proof. Since 
so- Avni) = (Ja'() (1-2). 


Theorem 2.5 implies that 








Pue(C;p) = DAO gaye ( Bile) =A) 
SAO) 5 9) So ALO 4 
2 De MO Le 








{#0 Aj_1(C) 


Oe Tete 


i-1 














The representation in Theorem 2.18 is useful because of the following 
result. 


Lemma 2.5. The functions A;(p) are increasing on [0, i) for alli > 1. 
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Proof. This is shown by straightforward calculus: 
A; = j-l nj 
d P) y "\ £5 ( w q ais qp ) 
dp =) g=1 q-1 q-1 
\ (_P_\4 qp NR yt q 
“0 (5) 0S} 





q-1 q-1 q—-1 
qu n es qp qp \"-s-1 
“AS GiG e- 
a Wea gad 
q wo(n vf ap \i qp \"-j-1 
AE (ea 
TA l ) q-1 q-1 





q (n\./ gp i qp ie 
— $= |= 
q—1 N) q-1 AY 


wi 
for all p € (0, 1>). 














Combining Theorem 2.18, Corollary 1.7, and Lemma 2.5, we get the 
following result. 





Theorem 2.19. Let C be an (n, M; q) code. If 
o A (C 
e (2.11 
(7) (21) 
ford+1<i<n-—d", then C is proper. 
Remark 2.2. The condition (2.11) can be rewritten as 
iA? (C) > q(n-—1+1)A$_1(C) (2.12) 
and also 
1 Gi 
(i-1)y 
2 “0 4,0) > gS S78 4,(0), (2.13) 
jan CO) zi "O 
where 


mj) =m(m - 1)...(m— j +1). 


2.3.5 Large codes are proper 


The code F? is proper: Pue(Fọ,p) = 1 -— (1 — p)” and this is clearly 


increasing with p. Hence, there exists a bound B(q,n) < q” such that if 
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M > B(q,n), then any (n, M,q) code is proper. We will give estimates for 
B(q,n). Let 
—(qn—n+q)+s 


B(q,n) = nna) 


b 


where 





s= y (m-n +q)? +4n(q— 1)(qn—n— q)q”. 
Lemma 2.6. If |C| < (q, n), then C is proper. 
We note that if [C| > q” — B(q,n), then 
IC| < blq, n). 


Hence from the lemma we get the following equivalent result. 
Theorem 2.20. For q > 2 and n > 3 we have 


B(q,n) <a" — Plan); 
that is, any q-ary code of size at least q” — (q, n) is proper. 


Proof. Combining Corollary 1.3 and Theorem 2.19 (with the condition 
in the form (2.13)), a sufficient condition for C to be proper is 


a (rai a" -200 (7) -D9 


i=1 
We also note ee 


2 (")a-0'=¥ h) 
(q¢—1)' = (q—1)' >q —1 
i PO 2 


for 2 < l < n. Hence, the sufficient condition for C to be proper can be 
written 


C Do tma ig- 2m (1) a- 


(3) 





“MY lo 4, + (q —2M)(q' — 1) 


w=1 Ni) 
>a A + (g OM) gg 1) (2.14) 


for2<Il <n. pe ue the term for i = l in the sum on the left-hand 
side and rearranging, we get the following stronger sufficient condition for 
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C to be proper (stronger in the sense that if (2.15) is satisfied, then (2.14) 
is also satisfied): 


I-1 
(g" -2M\(q—1) > My («- 4) Ai (2.15) 


for 2 < l< n. Since 


(l-1 (a- l ) TEE l-g, 





NG) l—i n (n — 1)G-1) 
we get 
1-1 
l-D¢ 
y "8 (a — ast (ql—1—q) ach (ql —1—q)(M —1). 
i=1 i 


(2.16) 
Combining (2.16) with (2.15), we see that a yet stronger sufficient condition 
for C to be proper is 
i 1 
(q" — 2M)(q— 1) > — (ql—1—q)M(M — 1) (2.17) 
for 2 < l < n. The strongest condition is imposed for l = n and this 
condition is 


slr 


(q” —2M)(q- 1) > 


This is equivalent to 


(qn —n — q)M(M ~ 1). 


M < B(q,n). 


This completes the proof of Lemma 2.6 and Theorem 2.20. 














Using different estimates, we can obtain a stronger bound than Theorem 
2.20; this bound is not explicit, however, but needs some computation. In 
(2.15), the terms in the sum are non-positive for q — l/(l— i) < 0, that is 
i > l(q— 1)/q. If we omit these terms, we get the stronger conditions 


Li(a—1)/4] Gia l 
(@-2M)\g-)>M Y Le («- =) A (2.18) 
i=1 (i) au 
for 2 < l < n. We note that the condition for l + 1 is stronger than the 
condition for l since 
l 
Oole- H5) _ lad+a-gi-i-) | 
(=o (a-G) C+D- a) 
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for i < I(q—1)/q. Hence, the strongest condition is the condition for | = n, 
namely 
ln(a=1)/a] 4 
(7 -2M)q-1)2M_ $) —(g(n -) — n) Ai. (2.19) 
i=1 
We have A; < () (q—1)*. We can use this bound on A;, but must take 
into account that $7, A; = M — 1. Let 


Mm = S @ (q—1). 


i=1 
Assume that M > M;,_, where r < |n(q — 1)/q], and let 





$= Slain — -m (7) (a= D+ aln- -nM = 1 - Mra). 








Then i 
pA Ln(q—1)/q} 
8 > Dani) —m)(") a= D+ alan) =n) A 
(ann) =) D> (")@—1) 
r—1 a i i [n(q-1)/a] 
=Sar-a(“)a= 18 +(an=N =n) D A 
[n(q-1)/q] r—1 
-e (a) mA Yar d((7) a- - 4) 
ina Ai/al i 
i=r+1 
[n(q-1)/a] 
> E a )- nA 


Hence, by (2.19) we get the following stronger sufficient condition for C to 
be proper: 


-2a -92 ain= = (“)(a— 1) 
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Solving this inequality, we get a maximal value which we denote by ,(q, n). 
Provided 3,.(q,n) > M,_1 (this was a requirement for the derivation above) 
we have 


Bin, q) < q” — B,(q,n). 
In particular, 31(q,n) = B(q,n). The larger r is, the larger is the corre- 
sponding 6r(q, n). Therefore, we want to choose r as large as possible. We 
know that we can use r determined by M;,_1 < B(q,n) < Mr. Usually, but 
not always, this is maximal. 


Example 2.10. We illustrate the procedure by a numerical example, 
namely q = 3 and n = 21. The values of M, and §,(3,21) are given 
in Table 2.7. Note that this is an example where r determined by 
M,-1 < B(q,n) < M, is r = 4, whereas the maximal r that can be used is 
larger, namely, r = 5. 


Table 2.7 Table for Ex- 
ample 2.10 


ro Mr-1 Br (3,21) 


1 O 106136.11 
2 42 110468.15 
3 882 115339.98 
4 11522 = 120392.84 
5 107282 121081.17 
6 758450 


A lower bound on B(q,n) is obtained by giving an explicit code which 
is not proper. Let 


4(2,n) = oL(n+3)/2] 
y(q,n) = glt for q> 3. 





Theorem 2.21. For q > 2 and n > 4 we have 
B(q,n) > q” — (q, n). 
Proof. Let C be the (n, q"; q) code 
{(xl0) € FẸ |x € FË}. 


A= ) (= 1) 


It is easy to see that 
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for all 7. Hence, 


PulC,p) = > (ra = p) - p) = (1 -— p) — (1 — p)”. 


a 
i=1 


Therefore, if f(p) = (q” — q”)Pue(C, p), Corollary 1.3 shows that 











Hence, 
Fp) == (n = k) (0 = p) + nla” = a) =p 
and so 
E = —(n— kh” + nfa” — q") <0 
if 


—q®(n—k)+n(qr-* — 1) < 0. (2.20) 


Let k = (n + a)/2 (where a = 1,2,3). Then (2.20) is equivalent to 


q=? (n — ga) —n<0. 





For a = 3 and q > 2 we have 


n— gQ 





-3 
n— q“ <n- <0 forn > 4, 
For a = 2 and q > 2 we have 
an7a gn—2 
n-q =z n=? ~z Sl fornz4. 


For a = 1 and q > 3 we have 


= =i 
n- <n-3-— <0 forn > 3. 


Hence, (=) < 0 for q = 2, k = |(n + 3)/2], and n > 4; and also for 
q>3,k = |(n+2)/2], and n > 4. 
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2.4 Results on the average probability 


2.4.1 General results on the average 


In this section we consider the average probability of undetected error for 
the codes in some set C of codes of length n. 

The common notations for the average and variance of a stochastic vari- 
able X are E(X) and Var(X) = E(X?) — E(X)?. Further o = Var(X), 
the standard deviation of X. 

For our particular application use the notations 


Pae(C, p) =- Se Pae(C,p), 
CEC 


and 


Var(C, p) = Var({Px(C,p) | C € C}). 


For SC Fy, let 
_ #{C EeC|S c C} 
#C 
If S = {x,,x2,...x,}, we write for convenience a(S) = a(x1,X2,...X,). 


From the definitions and Theorem 2.1 we get the following result. 


Theorem 2.22. Let C be a set of codes of length n and size M. Then 


H(x,y) 
Pae(C, p) = a 5 TEA y (1 — pray), 


(x.y )E(ZP)? q 
x4y 


Proof. We have 


dy (x,y) 
P(C, p) St y (— ) (1 — p) 7y) 
- cee M (ae; pee? a 1 
1 P du (x,y) n—dy(x, 
= Ga orgs 
Cec 


(x,y )E (ZR)? 
xA#y 


R A 


—1 
(@y)E(Zg)2 q 
x#y 


x,yEC 


l 
| 
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For linear codes we get a simpler expression, using Theorem 1.13. 


Theorem 2.23. Let C be a set of linear codes of length n. Then 


PoC E a(x) (2) a — pyre, 


x€GF(q) q-1 
x0 
Let 
1 
A(C) = = >> Ai(C). 
#C Cec 
Then clearly 
Z p t n—i 
PalC, p) = Y Ai(C)(—2) 0 - py. 


q—1 


From Theorems 2.22 and 2.23 we get the following corollaries. 


Corollary 2.5. Let C be a set of all codes of length n and size M. Then 


agep 2 abe. 


dy(x,y)=i 


Corollary 2.6. LetC be a set of all linear codes of length n and dimension 
k. Then 


x€GF(q)” 
wy (x) =i 


2.4.2 The variance 


For the variance we get similar results. 


Theorem 2.24. Let C be a set of codes of length n and size M. Then 


1 
Var(C,p) = —Pae(C,p)? + We 5 a(u, v, x,y) 


(av) Gaye (FP)? 
u#v,xźy 
( p m a 


yen datas aa Cay) 
q-1 


=P 
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Proof. We have 


E(Pue(p)) = a Se Pae(C, p}? 


CEC 
dy(u,v) 
os _ n—dy(u,v) 
-zby L (G) l a-y 
CEC (u, Oe 
1 p dy (x,y) d 
mS p J — p)r— daly) 
M x G = z) (Lo?) 
a 


1 
= 7p 5 a(u,v,x,y) 


(u,v), Gaye (Fp)? 
užv,xźýy 


( p e _ p)?” dn (av) dey), 


q-1 














Similarly, we get the following alternative expression for linear codes. 


Theorem 2.25. Let C be a set of linear codes of length n and dimension 
k. Then 


Var(C,p) = 
a(x, y) pur +uH(¥) (1 — p)? va (x) -wa lY) _ D(C, p)?. 


x, yEGF(q)”\{0} 


2.4.3 Average for special classes of codes 


Next, we consider the average for some special sets of codes. First, for some 
fixed (n, L; q) code K, let 


Cmy(K)={C|CCK and #C = M} 
denote the set of (n, M; q) subcodes of K. 


Theorem 2.26. Let K be an (n, L; q) code and M < L. Then 


M-1 
Pae(Cimy (E), p) = gorie p), 


Proof. First, 
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We see that if x ¢ K or y ¢ K, then a(x, y) = 0. On the other hand, if 
x,y € K and x £ y, then 


HC € Can(K) xy E= (a) 


and so 


and 


A(n (K) = 3 X a(x,y) 


Gye(Zn)? 
dy(x,y)=i 

5 M(M — 1) 
keki L(L-— 1) 
dy(x,y)=i 


M-1 1 
— _— 1 
ce oe 
(x, y)EK? 
dyy (x,y) =i 





M-1 
= =—Ai(K). (2.21) 





Hence, 


-1 
Pae(Cimy(K), p) = gorie p). 














Choosing K = GF (q)” we get the following corollary. 


Corollary 2.7. We have 


Pae(Can (GFO) p) = G1- (= v)"}. 


Remark 2.3. Corollary 2.7 implies in particular that for any given n, M, 
q, and p there exists an (n, M; q) code C such that 





Pe(C p) s Z {1-0-9} (2.22) 


It is an open question if, for given n, M, q, there exists an (n, M; q) code 
C such that (2.22) is satisfied for all p € [0, (q — 1)/q]. A code satisfying 
(2.22) for all p € [0, (q — 1)/q] is clearly good. However, it is even an open 
question if there exist good (n, M; q) codes for all q, n and M. 
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A sufficient condition for the existence of codes satisfying (2.22) is the 
following. 


Theorem 2.27. Let C be an (n, M,d; q) code with d > tn. Then (2.22) 
is satisfied for all p € [0, (q — 1)/q]. 


Proof. First we note that 


(q—1)n+1 
E : 


d>d= 


By Theorem 2.49 (which is an immediate consequence of Lemma 2.1), we 
have 





Pao(C,p) < (M -D( 2) a-p. (2.23) 
Let 
_ 5 
fe) = {1-0} -r- (2) a -p 
We see that if f(p) > 0, then by (2.23), (2.22) is satisfied. We have 
g M-—1)n àa M-i 5 mE 
f'(p) = aap) we QD” =p tin) 
=M- -PH - —ato)}. 
where 
_ p16 — np) 
We have 
, > 5p>—? = ôp?-? q— 1 
g'(p) = ia 1- (n-— 1)p) = ape ee -p) = 0. 
Hence, g(p) is increasing on [0, (q — 1)/q]. Since 
7 _ (M—-1)n (q-1 M-i n 1 
D e a a a ee 


this shows that f’(p) is first positive, then negative and so f(p) is first 
increasing, then decreasing on [0, (q — 1)/q]. Since f(0) = f((q— 1)/q) = 0, 
we can conclude that f(p) > 0 for all p € [0, (q — 1)/q]. This completes the 
proof. 
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Remark 2.4. By Theorem 2.15 we know that d > in also implies that 
C is proper. A simple example which shows that for d < “+n, C may be 
proper without (2.22) being satisfied, is the proper (5, 4;2) code 

C = {(00000), (00001), (00110), (01111)}. 
The distance distribution for this code is 1,1/2,1,1,1/2,0 and it is easy to 
show that 


spl - p) +p’ (1 -— p)? +p? -— p)? + Lya —p)> Ža — (1 —p)*) 


for all p € (0,1/2), that is, (2.22) is never satisfied for this code. 


We get similar results for the average of linear codes. For some fixed 
[n, k; q] code K, let 


Cir (A) ={C|CCK and dim(C) =k} 
denote the set of |n, k; q] subcodes of K. 
Theorem 2.28. Let K be an |n, Kk;q] code and k < K. Then 


alee 
Pae(Cigy (K), p) == qr — q Pulp). 





Proof. First, 

i _ Tig" =) 

k] awa 

the Gaussian binomial coefficient. We see that if x g K, then a(x) = 0. 
On the other hand, if x € K, then 


#Cw (K) = | 


HCE xech= |k h 











1 
and so 
kl 1 

and 

gi 

Ai (Ci (K)) = Po Ait). (2.24) 

Hence, 

q? — 

Pac(Ci (K), p) = Pue(K, p). 
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Choosing K = GF(q)” we get the following corollary. 


Corollary 2.8. We have 


q" —1 


Pue(Cuy(GF(Q)"),P) = F {1 ~ 0)" 





Comparing Corollaries 2.7 and 2.8, we see that 


Even if the averages happen to be the same, there is no reason why, for 
example, the variances should be the same. This is illustrated by the next 
example where the variances are different. 


Example 2.11. In Example 2.3 we gave a listing of the possible distance 
distributions of (5, 4; 2) codes. From these we can compute the average and 
variance. For the average we get 


PuelC(ay(GF(2)°),P) = PaE (GFO), p) = =-{1 - (1 p)°}, 


as we should by (2.25). For the variance we get 


42 O 

Var(Cia(GF(2)°),P) = gg 2 uP (1 — py", 
i=2 
10 

5 +t i — 10% 

Var(Cj2\(GF (2) Jp) = 37 MP (1 = p), 


where the u; and v; are given in Table 2.8. It turns out that 
Var(Cia)(GF(2)°),p) < Var(Cig(GF(2)5), p) for all p € (0,1/2). 
Table 2.8 The coefficients u; and v; in Example 2.11. 
i 2 3 4 5 6 T 8 9 10 


ui 369 720 1818 1800 1890 864 513 72 45 
vi 19 20 68 50 70 24 23 2 3 


2.4.4 Average for systematic codes 


Next we will consider the average probability of undetected error for some 
classes of systematic codes (the variance can be found in a similar way). Let 
SYS(n,k) be the set of systematic (n, q}; q) codes, SVSL(n,k) be the set 
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of systematic linear |n, k; q] codes and SYSL(n,k,d) the set of systematic 
linear [n, k, d; q] codes. We consider first general systematic codes. Then 
we give a more detailed analysis of some classes of systematic linear codes. 


Theorem 2.29. For1<k<n, we have 
Pue(SYS(n, k), p) = a" {1 = (1 =p). 


Proof. We can write the vectors in the form (u|v), where u € FY and 
v € Fr-*. Let C be a systematic (n, q”; q) code. By definition, two distinct 
code words of C must differ in some of the first k positions. Hence, for the 
class SYS(n,k), we get a(x, y) = 0 if x and y are identical in the first k 
positions. To get a systematic code, the last n — k positions can be chosen 


k 
q 
arbitrarily. Hence #SYS(n, k) = (Cag) . On the other hand, if x and y 


q*—2 
differ in some of the first positions, then they are contained in (ar) 
codes. Hence 


n—k q2 
(4 ) 2k—2n 
a(x, y) = ee : (2.26) 
(==) 
From Theorem 2.22, we get 
1 p dy(u,u’)+dy(v,v’) 
Pelle a ) 
(C, p) ma x a A 
(ulv), (|v) EZF) 
užu’ 


(1 spy ey 


_2n p dy(u,u’) = ui 
Sgm N ( ) p E 


—1 
(ayu/)e(ZB)2 a 
užu’ 


Pp da(v,v’) n=k= v,v’ 
2 a Coes 
(vv je (Zg 7")? í 


=q*" E {1-a-p)*} ye 


ucZk VEZNE 
= q7. dfi =ü — p)*} grok 


=a "{1-(1-p)F}. 
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Theorem 2.30. We have 
A;(SYSL(n,k,d)) =0 forl<i<d-l, 


gay Cay 


a 


Il 
© 


with equality ford = 1. 


Proof. Let i> d. A vector of the form (0|v), where v € GF(q)"~* \ {0}, 
is not contained in any systematic code. Hence the number of vectors x of 
weight ¿ which can be contained in a systematic code is 


(e-v -07 *)a-ay. 


Let x be such a vector. It remains to show that 


(2.27) 


g=- So (CT 1) 


The vector x contains at least one non-zero element in the first k positions. 
Without loss of generality, we may assume that it is a 1 in the first position, 
i.e. x = (1|z|v), where z € GF (q). Let 


( 1 z v ) 

ot Ip P 

be a generator matrix for a code in SYSL(n, k, d) which contains x. Then 
(Ik-1|P) generates a code in SYSL(n — 1, k — 1, d). Hence 


#SYSL(n — 1, k — 1,d) 


< 2.28 
oC) ST FSVSL(n, Fd) oe 
On the other hand, if G = (Ip_i|P) is the generator matrix for a code in 
SYSL(n—1,k—1,d), then any non-zero linear combination of d—1 or less 
columns in the corresponding check matrix (P'|I,—,) has a non-zero sum. 
Let y be a non-zero vector in GF(q)"~* different from all sums of d — 2 or 


less of these columns; y can be chosen in at least 


d—2 


e-i- ("7 a-n 


l=1 
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distinct ways. Each choice of y gives a check matrix (y*|P*|In-x) such that 
any combination of d — 1 or less columns has a non-zero sum, that is, a 
check matrix for a code in SYSL(n, k, d). Therefore, 


#SVSL(n, k, d) nk © fn- I 
syllu- Lk- L) | -5 ( l Je- 1 


l=0 


(2.29) 


Combining (2.28) and (2.29), we get (2.27). For d = 1 we get equality in 
both (2.28) and (2.29). 














Theorem 2.31. Let C = SYSL(n, k, d), the set of all systematic |n, k, d; q 
codes. Then 





cG e aa 
Pye(C,p) < qc“ 
grey Ce gan 
1=0 
with equality ford = 1. 
Proof. Let 
1 
B= d—2 
ga - So" )(q-D! 
1=0 
By Theorem 2.30 we get 
n n—k 
Pae(C, p) < Ay ("Ja -p-> e ‘ra — p)" } 
i=d i=d 
k ys (0 n—k i n—i 
=6{1-(1-p) ->{(") = ( ; )}o (1 —p) \. 
i=1 











From Theorems 2.29 and 2.31 we get 
Prue(SYSL(n,k), p) = Pue(SVS(n, k), p), 


that is, the average over all systematic linear codes is the same as the 
average over all systematic codes (when q is a prime power). We had a 
similar result for the set of all [n,k;q] codes and all (n,q*;q) codes. We 
also note that on average, systematic codes are better than codes in general. 
We state a somewhat stronger theorem. 
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Theorem 2.32. Let 1<k<n and p € (0,(q—1)/q). The ratio 


Pro(SYS(n,k),p) gf -(1 — p)*} 


PCF”) gafa} 
k(1—q ”) 


is increasing on (0, (q — 1)/q). For p — 0+ the ratio is ptij» and for 
p= (q — 1)/q the ratio is 1. 





Proof. Let 
f(p) = PaelSYS (n, k), p) = 1- 0- p)" }, 
0) = Pela FP) = EH - oon 
h(p) = f(p)/g(p). 














Then 
g(p)*h'(p) = f(p)g'(p) — f'(p)g(P) 
ao 
= Pi - (1 =p a — = n( a 
—n = k 1 n 
—q""k(1 — p)* roe as Cae) } 
k 
= q (1 - pF 0), 
q 
where 








We have F(0) = 0 an 
F'(p) = ah — K)(L = p)" + (n = k)n( — p)?? 
= n(n — b)(L—p)"-*-1{1 - (1 = p)*} > 0 
for all p € (0, (q — 1)/q). Hence h’(p) > 0 for all p € (0, (q — 1)/q). 














How good is the upper bound in Theorem 2.31? It is exact for d = 1. 
We will next derive the exact value of Pue(SVSL(n, k,2),p) and compare 
this with the upper bound in Theorem 2.31. 


Theorem 2.33. Let C = SYSL(n,k,2). For1<i<n we have 


so- 2a (") - (Bee - SOT 
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where r=n—k and ¢ = —1/(q" — 1), and 


Pel p) = {1 — (py! = g"( =)" + a"(0 — o), 





where 
qp 
6(p) = 1- ——. 
gq —-1 





Proof. We see that (J;,|P) is a generator matrix for a code in C if and 
only if all the rows of P are non-zero. In particular, #C = (q" — 1)*. 

For given non-zero elements a1, Q2,...Q@;, consider representations of 
v E€ GF(q)": 


V = Q1X1 + Q2X2 +--+ + QiXi (2.30) 


where x; € GF(q)” \ {0} for j = 1,2,...,i. The number of such repre- 
sentations depends on whether v is the all zero vector or not. Let (i) be 
the number of representations of 0 € GF(q)” and y(i) be the number of 
representations of v € GF (q)” \ {0}. 

We note that (2.30) is equivalent to 


V — QiXi = Q1X1 + QA2X2 ++ Qi—1Xi—1. (2.31) 


If v = 0, then the left-hand side of (2.31) is non-zero. Hence in this 
case (X,,X2,...,X;—-1) can be chosen in y(i — 1) ways for each of the q” — 1 
choices of x;. Hence 

bli) = (g — 1)y(i — 1). (2.32) 

Similarly, if v 4 0, then the left-hand side of (2.31) is non-zero except when 
a;X; = v. Hence we get 

Bi) = (q = 2)yli — 1) + (i 1). (2.33) 

The pair of equations (2.32) and (2.33) constitutes a simple linear recursion 

that can be solved by standard methods. The start of the recursion is the 


obvious values 3(1) = 0 and y(1) = 1. The result, that can easily be 
verified, is 








ti) = fg -y 1t), 
=e- (-1)} 


The number of codes in C containing a vector (u|O0) where wy(u) = i 
is 3(i)(q” — 1)*~* and the number of such vectors is (E) (q—1)'. For 0 < 
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l < i, the number of codes containing a vector (u|v) where wy(u) = l 
and wy(v) = i — l is y(1)(q” — 1)*~', and the number of such vectors is 
(60 ,)(q- 1). Hence 


M0= (gmap a -P--9} 
FOCE wore 


IEA OEO 
CITO 


we get the expression for A; given in the theorem. Putting this into the 


Since 


us 


expression Pue(C,p) = Xii Ol) (1 — p)"~* and rearranging, we 











get the expression for Pye(C,p) given in the theorem. 





The upper bound in Theorem 2.31 is exact for p = 0 and too large by the 
quantity ir for p = m. The difference between the upper bound 
and the exact value appears to increase monotonously when p increases 
from 0 to T The exact value of Pue(SYVSL(n, k, d), p) for general d is not 
known, and it is probably quite complicated both to derive and express. 


Our final example is systematic binary even weight codes. 


Theorem 2.34. LetC be the set of all systematic even weight |n, k; 2] codes. 
Then 


A:(C)=0 for oddi, 
1 n n—k , 
Ale) = sar{(") - ( i )} for even i > 0, 
and 


Puo(C,p) = sg {1 + (1— 2p)" — (1p) — (1 2p) (1 — pb. 


Proof. The proof is similar, and we only give a sketch. All the rows of P 
have to have odd weight. For a vector x = (u|v), where u Æ 0 and wy(x) 
is even, we have 





a(x) = 27079, 
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and for even i there are (") — (F such vectors of weight i. This proves 
the expression for A;(C). Further, 


aen r S14 (5) - ("a aa 


=l 








=2-"{14 (1 — 2p)” — (1 — p)? - (1 — 2p)" (1 — p)" }. 











2.5 The worst-case error probability 


In many applications we do not know the value of p, at least not exactly. 
In other applications we may want to use one code for several different 
values of p. Therefore, we will be interested in finding the worst-case error 
probability 


Pywc(C, a, b) = aac Pyue(C, p) 
for some interval [a,b] C [0,1]. First we give an upper bound. 


Theorem 2.35. Let C be an (n,M;q) code. Then 


Pye(C, a, ) < X` Aiun ala, b) (2.34) 


=r 


where Ao, Ai,...,An is the distance distribution of C and 
(4) (1—a)"-?* if + <a, 


pnslasd)=4 (aia) (1-4) fasts, 
(A) a-o yi> 


If C is a constant distance code, we have equality in (2.34). 


Proof. First we note that the function p’(1 — p)"~* is increasing on the 


n 


interval [o i] and decreasing on the interval [1 . Hence 


sass, (FT) 0-0} = msta, 
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and so 











If C is a constant distance code we get equality. 





The bound in Theorem 2.35 may be improved by subdividing the inter- 
val [a,b]: let a = ap < a1 < +++ < am = b. Then clearly 
Poel; a, b) = EA Pac (C, Qj-1, aj). 


It is easy to see that if 


n 


P= SOAs max | > (25) 0-9") 


i=l 





then 


Pwe(C, aj-1,4j) < XO Astin ilaj-1,a5) 
i=1 
< Pae(C, aj-1) + (aj — aj-1)F 
< PuclC, @j—1,4;) + (a; = aj—1)F. 
Hence we can get as sharp a bound as we want by subdividing the interval 
sufficiently. 


Remark 2.5. It is possible to find an upper bound similar to the bound 
in Theorem 2.35, but in terms of the dual distribution Aj, A+,..., A+: 

1 n 
pe 5 Aj vn ala, b) 


i=0 


Pue(C, a,b) < 





where ,;(a,6) is the maximum of the function fn (p) = (1 — ip), — 
(1 — p)” on the interval [a,b]. As in the proof above, the maximum is 
obtained in a, in b or in an inner point in (a,b). In most cases we cannot 
find explicit expressions for the maxima of fn,(p), but we can find as good 
approximations as we like. If we only want to find the worst-case of one code 
on one interval, it is probably a better idea to do this directly. However, 
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once we have computed the maxima of f,; for i = 0,1,...,n, then it 
is a simple task to compute v,,;(a,b) for any a and b, and since this is 
independent of the code, we can compute the worst-case probability of any 
code for which we know the dual distribution (of course, an alternative is 
to compute the distance distribution of the code itself using Mac Williams’s 
identity and then using Theorem 2.35. However, it may be simpler in some 
cases to use the attack sketched above). 


Next we consider a bound on the average value of Pyc(C, a,b) over some 
class C. Let 


Pwc(C, a,b) = Zz Ð Pee(C, a,b). 


cEC 


From Theorem 2.35 we immediately get the following bound. 


Theorem 2.36. Let C be a class of (n, M; q) codes. Then 
Pyc(C, a,b) < 5 Ai(C)un,i(a, b). 
i=1 


We now consider the worst-case over [0,1]. Since 


nal) = (go) 0-5) 


we get the following corollary to Theorem 2.36. 


Corollary 2.9. For a set of (n, M;q) codes C we have 


n 


recon S32 0a) (0-5) 


Some special cases of Corollary 2.9 are given in the following corollaries, 
obtained using (2.21) and Theorem 2.30. 


Corollary 2.10. We have 





where 


EH es) 
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Corollary 2.11. We have 


1 
Pwe(SYSL(n, k), 0, 1) < gn (Sn E Sn,k), 





where 


T OC ay 


Note that S, and Sn, do not depend on q. Some relations for the sums Sn 
and Sn, are the following: 


FE ged 7m 1 0.108918 
Peete < Sn < Do cae ae (2.37) 
Tm 1 V 2T 1 
Sa= 43 at ge tO) veen n> o, A 
1 (2k mmn 2 1 
Samla) VF ata) me noo ea) 
for k > 1, 
— 1 
q 
For example, 
1 
Poe(Ctyy(GF()"),0,1) = gS 


and 


Peel SVSL(n,1);0;1) = Se: 


n 


For q = 2 and the worst-case over [0, 1/2] we get 


1 iNi inni n 
w(0d)= (0-3) oe <9 
Hn.i(065 n n mr S3 


1 1 
Hn,i (0, 5) = Fn for i> = 


Since 

[n/2] a; : n 

iNi 1\n-t 1 1 

a) a 

i=l [n/2]+1 
we get 

ke 
Pue (Cu (GF(0)"),0, 5) < a yi (2.41) 
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In particular, for given n and k, there exists an [n,k;2] code C such 
that 
1 Q*—1 mn 1 0.108918 
Pel Fede) 5) ( > 2 ). 
O%5)<sa—-pj 3T va 
The following improvements have been shown. 


(2.42) 


Theorem 2.37. There is a constant y such that for given n and k, there 
exists an [n,k;2] code C such that 


1 
Pie (c, 0, 5) < QR-2 (yy Inn+1). 
For sufficiently large n, the result is true for y = 2/,/n. 


Theorem 2.38. There are constants v and qı such that if k > v(Inn)?, 
there exists an |n, k;2] code C such that 


1 
Prue (c.0, 5) < yok", 


Corollary 2.10 shows that an average code is not too bad. However, 
some codes may be very bad as shown by the next theorem. 


Theorem 2.39. For each y > 0 and each € € (0. +), for all 


1 y 
heen ca e 


there exists a code C of length n such that 
Pae(C, p) > y: Pae (c, 1) 
for all p€ e, = - €]. 
Proof. Let Cn = {(a|0) € Fẹ | a € Fy}. Then 
Pae(CnsP) = fnlp) = (4 - mer Spy soley, 


1 
and 


© fa(p) = (1 p)"2(1 — np). 
p 


Hence, if n > 4, then 1 — np < 0 for p > €, and so fn(p) is decreasing on 
le, 1]. Further, if m > 1 + ETUC then 


fn( -e 7 
A (tow > (1-4 


Example 2.12. For q = 2, N = 4 and e€ = 0.1, we can choose any n > 10. 








-«) ae(n —1)>¥%. 
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2.6 General bounds 


Define 
Pye(n, M, p; q) = min{ Pye(C, p) | C is an (n, M; q) code}, 
Pye|n, k, p; q] = min{ Pae(C, p) | C is an [n, k; gq] code}. 
In many cases one is not able to find the weight distribution. There- 


fore, we give a number of lower and upper bounds on P,.[n, k, p; q] and 
Pae(n, M, p; q) which may be used to estimate Pre. 


2.6.1 Lower bounds 
We have Ag > 2/M for an (n, M, d; q) code and Ag > q—1 for an fn, k, d; q] 
code. Hence we get the following two trivial lower bounds. 


Theorem 2.40. For any (n, M,d;q) code C and any p € [o, = we have 


Pa(C, p) > = Ca “(1 pyi, 


Theorem 2.41. For any [n,k,d;q| code C and any p € lo, 11 we have 


Pelc p) > (@-1)(2)"a ny" 


There are a couple of bounds that follow from Lemma 2.1. 


Theorem 2.42. For any n, k, and any p € [o, 1 we have 


ngk—l (a++) 
yaa E 

q-1 
Proof. Let C = {xo = 0,x1,°++ ,Xgx_1} be an [n, k; q] code and let its 
support weight be m. Let ti = w(x), the Hamming weight of the x;. 
Then 


Pacln, k, p; q] > (a = 1)( 


qh-1 


= p ý n—ti 
Pae(C, p) = 2 a) (hp: 
If j is in the support of C, then 1/q of the code words of C have a zero in 
position j, the remaining have a non-zero element. Hence 


qe-1 


5 y= mg*}. 
i=1 
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q-1 
qe-1 qh-1,. 
IES ae “a pyat—pe-Di n 
i=1 pd 
p mq”! k k-1 
= 1 — p) Dnm" 
Ge -) (1 —p) 
If y1, y2,--: , yw are real numbers such that JAA yi = c > 0, then it is 


easily seen that BAN yi > N cÑ, and the minimum is obtained when all 
the y; are equal. In particular, we get 


Pae(C, p) > (4° — (Aa Sp tas 
: p mir 

SIP aa) 
> (a = N= p(B 





for all p € [o, s]. 











Theorem 2.43. For any n, M, and any p € [o, 14 we have 


M n 
Pye(n, M, p; q) > mE (1-—p)”. 


Proof. Let C be an (n, M;q) code. Then 





M a q n M n 
Pie =—A — — — — > — — — : 
ue(C, p) n é(1 =P) (1 p) AET (1 p) 








Remark 2.6. The bound in Theorem 2.43 is negative and hence not in- 
teresting for p < 1 — 1M t/n. However, for p > T it is asymptotically 
tight. 





From Theorem 2.5 and Corollary 1.2 we get the following bound. 
Theorem 2.44. For any n, M, and any p € [o, 11 we have 


Pue(n, M, p; q) 2 > (") (4 -1)(4)'0- a)” 


=A 
i=n— |log,(M)] 1 
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Theorem 2.45. For any n, M, and any p € [o, a] we have 


p t q n—i 

uel“ »D; 2 $ 1— ) ’ 

Pye(n, M, p; q) | 5 alaa T 
i=n— |log,(M)] 


where 











a= (E -De-A 


For linear codes there is a bound of the same form as Theorem 2.43 
which follows directly from Corollary 1.10. 


Theorem 2.46. For any n, k, and any p € [o, a] we have 





Pool k pia] > = I (1+ (@-1)(1- p) ) - apy 
j=k+1 


Lemma 2.7. Let C be an (n,M;q) code, O < u < 1, and p € [o, 4). 
Then 


els 


Pae(C,p) = (M -1+ 0- (2) + =a)" } 





Pre (c, ( 1)p* + ( Da a 
Proof. Let 
cafe tee MOE a 
ie (q—1)p%+(q—-1)"(1—p)™ (2.43) 
Then 


Pu p 


(q— 1) = pu) Gane) 
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Let Xio Aiz? be the distance distribution function of C. Using the Hélder 
inequality (see e.g. Rudin (1970, page 62)) we get 


Pae(C, Pu) = (1 — pu)” 2 alga) : 


O (=p) “ l i 
e a aL FA 


(1—p) a iyuc 1—u 
Ee copy yaaa) } (24) 
1 “cag Hiii 
eea (M —1) 














Solving for Pae(C, p), the theorem follows. 





Theorem 2.47. Let M = [q®"] and 


R 


Sa e 


log, (q 


— log,(4 — 1) 
Then 
p na(n,R) E 
Pae(n, M, p; q) > (M — IE (1 — p) 0720A), 
Proof. From the definition of a = a(n, R) we get 
gi Fal/n =f 
q—1 


Gee = 
which in turn implies 
1 


Rn-n _ 2 : 
qehe werd 


(2.44) 


Let 
R 


alogia) 
and define pu by (2.43). Then 


Rie p Oe. Pu 
3 Aea Ge 
and so 
TEEST ae 
Pu Te AFT 
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By (2.44), this implies that 


ght" = 21 —= py)”. (2.45) 
We have M = [q®"] > q?" > M — 1 and so 
—Rn 
ia eos 


Combining all this with Lemma 2.7 and Theorem 2.43 we get 


pen zoo ta-o) +a} 


gM a-p" mł 
tr (Cia ane! 


Lo —1)-*{(@- 1) (2) +n)" 

qe -(1 - pyr} 
spe e a + 0-2)" 
soad a) om” 


Theorem 2.48. Letn, 2 < K<M andpe [o, oa be given. Then we 


have 


n 
u 


n 
u 














M-K+1;/ p \4K) 
Pao(n, M, p; > (+) 
(n, M, p;4) 2 — T a 


Proof. Let C be an (n, M) code and let 
E = {(x,y) € C? | 0 < dy(x,y) < d(n, K)}. 

We consider C the vertex set and E the edge set of a graph. Let F C C be 
an independent set (no two vertices in F are connected by an edge). Then 
du(x,y) > d(n, K) 
for all x,y € F. By the definition of d(n, K), this implies that #F < K-1. 

A result by Turan (see: Ore (1962, Theorem 13.4.1)) implies that 


(1 _ pro) i 





M-kK+1)M 
#E> (M-K+1)M 
K-11 
Hence 
1 p \¢a(xy) aye 
Pae(C, p) = — (=) (1 — p)?- dey) 
(yer T 
M—K+4+1/ p \4K) be ies 
2 MORN (2 yg yaar 
K-11 \q-l 











Error detecting codes for the q-ary symmetric channel 89 


2.6.2 Upper bounds 


We now turn to upper bounds. From Lemma 2.1 we immediately get the 
next theorem. 


Theorem 2.49. For any (n, M,d;q) code C and any p € lo, 1] we have 


Pae(C, p) < (u — 1) (0 -= p)". 


The bound can be sharpened using upper bounds on A;. The proof is 
similar, using Lemma 2.2. 


Theorem 2.50. Let C be an (n, M,d; q) code and p € [o, m]. Suppose 
that A;(C) <a; ford <i< n. Letl be minimal such that 


Saen 
i=d 
Then 
Pep) < F a(t- (M-15 a) (25) a-p. 
i=d i=d 


Example 2.13. Consider an [8,4,3;2] code. By Theorem 2.49 we get 
Pae(C, p) < 15p°(1 — p)*. 

Using the bounds on A; given in Theorem 1.22 we get 
Ag < |=| =9, Ay<14 

and so 


Pae(C,p) < 9p3(1 — p)? + 6p*(1 — p)*. 


q 


Pae(C, p) < (E ians (*) Cee + > (a-p. 
i=l ina 


In particular 


Theorem 2.51. Let C be an [n,k,d;q] code and p € lo, a]. Then 


Pael(C, p) < (1 — p)" — (1 — p)” 


for any |n, k; q] code C. 
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Proof. Let G be a generator matrix for C. Since equivalent codes have 

the same probability of undetected error, we may assume without loss of 

generality that G = (zQ) where Q is some k x (n — k) matrix. We have 
wH(xXG) = wH(x) + wg (xQ) > wH(x). 


Also wy(xG) > d. Hence 


D wH(xG) ps et 
P(Cp)= > ra (Spree 
x€GF(q)*\ {0} 
pe = 
< ye (Gar (1—p)"¢ 
xE€GF(q)k 


1<wy(x)<d 


EE aeo 


-1 
x€GF(q)k q 
dtl<wy(x)<k 


In particular 
n— p wu (x) —wy(x 
Ponting h So (oe) (ap ese 
xEGF(q)*\ {0} 


= (1-p)"*(1-(1-p)*). 














Remark 2.7. If C is the code generated by (re) where Okx (n-k) 
is the k x (n — k) matrix with all entries zero, then 


Pae(C, p) = (1 — p)"™* — (1 — p)”. 
Hence, Theorem 2.51 is best possible for |n, k, 1; q] codes. 
Example 2.14. If we use Theorem 2.51 we get 
Pac(C,p) < 14p°(1 — p)? + p*(1 — p)* 


for an [8,4,3;2] code, and this is weaker than the bound obtained from 
Theorems 1.22 and 2.50. In other cases, Theorem 2.51 may give a better 
bound. For example, for a [14,5,3;2] code C, Theorem 2.51 gives 


Pae(C,p) < 25p?(1 — p)™ + 5p*(1 — p)? + p°(1 — p)’, 
whereas Theorems 1.22 and 2.50 give only 
Pae(C, p) < 30p°(1 — p)™ + p*(1 — p)”. 
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In Example 2.3, we gave the distance distribution of all (5,4; 2) codes. 
As shown in that example, Pye(C,p) < (1 — p)? — (1 — p)® for all these 
codes, linear and non-linear. However, for non-linear (n,q*;q) codes C in 
general, we may sometimes have Pae(C, p) > (1—p)"~* — (1 — p)”. We will 
illustrate this with an explicit set of codes. 


Example 2.15. Let k > 4 and n = (që —1)/(q—1). Let C be the (n, q"; q) 
code containing the zero vector and all vectors of weight one. Then it is 
easy to see that 


Ai(C) = q(q-1)n/q*, 
A2(C) = (q-1)?n(n — 1)/4*, 
Therefore i 
f(p) = Pac (C, p) = z(a = Dn 





a) a 


+(a-1)?n(n-1)(—2_)'a-yr*} 


= Ffo =p) + n(n- 1)p? (1 - pyr}. 


Computing the derivative and evaluating it for p = (q — 1)/q, we get 





f( =) E -G = T) (q 1)(q”% 2g! — gk + q’). 
Let g(p) = (1 — p=” — (1 S Then 
-1 
i} = agen — (q — 1)kq" + (q — 1)°). 
Hence 


aa ares) 
q q 
1 
= k _ —2 2k 2/_,k = 1 2 a) 
recat q—k—2q)q +q (q — 1) +24 >0 
if kq — k — 2q > 0. This is satisfied for k > 4 and all q > 2 (and also for 
k = 3 when q > 3). Hence 


(471 (4—1 
Sg ow ae a 
for k > 4. Since (=) = (5) this implies that f (p) > g(p) when p 
is close to (q — 1)/q, that is, 
Pae(C, p) > (1 — p)" — (1 — p)” 


when p > pq,k for some pq,k. For example, p3,4 œ~ 0.1687502258. The first 
few values of po, are given in Table 2.15. 
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Table 2.9 The first few values of pz, in Example 2.15. 


k 4 5 6 7 8 9 10 


P2,k 0.28262 0.15184 0.08380 0.04677 0.02618 0.01463 0.00814 


It is an open question in general to determine 
max{Pue(C,p) | C is a (n, M; q) code} 
for given n, M,q,p. 
The average results clearly give upper bounds on Pye[n, k, p; ql. 


Theorem 2.52. If K is any [n,«;q] code and 1 < k < r, then 


k 





zaj 
Pac(K, p) 


Paen, k, p; q] < — 
qr -1 


for all p € [o, s]. In particular, for all n> k > k > 1 we have 








god 
Pre[n, k, p; q] < —— Preltt, 6, p; dl 
Further, for all n > k > 1 we have 
k 
q —1 n 
Pueln, k, p; q] < Po ={1- (1—p) is 


and for alln > k > 1 we have 





Pamina s F—={1 + (@—(1- 4p)" - apy". 


Proof. The main result follows from Theorem 2.28, and the special cases 
by choosing K = GF(q)” and K = {(%1,20,...,2n) | yi, zi = 0}, 
respectively. 














There are sharper bounds in some cases. 


Theorem 2.53. For any integers n > k > 1, any p € lo, 1] , anyu € 


(0, 1], and any prime power q we have 
k 


Pel pial < (L—{ (@-0()"+0-9")"-a-2"}) 


q 


el- 
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Proof. Let C be an [n, k; q] code. Using the natural isomorphism between 
GF(q)” and GF(q”) we may consider C as a subset of GF(q"). For any 
non-zero g E GF(q”), the set 

gC = {gx |x C} 
is again an [n, k; q] code. For convenience we write 


k 1 
q — 1 p “ u\” un % 
J=($5 u-d) +0-9") -0-9 
qv —1 q-1 
the right-hand expression in the theorem. Define ¢(g) by 
C(g) =0 if Puc(gC,p) < J. 
Note that ¢(g) < Folge) for all g. We will show that ¢(g) = 0 for at least 
one g. First we show that 








PE a wales), 
w< y (en) (2.46) 


wy(gz) 
) “ (1 — p)’-“#(9") > J for some x € C \ {0}, then (2.46) is 


a 


if (2 
clearly satisfied. On the other hand, if (4 
for all x € C \ {0}, then 
p wu (gx) 1 n—wy (gx) 
Pal) y (a) ia 


c(9) < — > = 
a zeC\{0} d 


(1 — p)" YRU) < J 


C. — p)" Ya (92) ú 


2eC\{0} 
and (2.46) is again satisfied. 
If x € C \ {0}, then gx runs through GF(q”) when g does. Hence 
Eanes D B aope me 
gEGF(q”)\{0} g#0 wEC\ {0} 
1 p m 


J xEC\{0} g#0 god 


-+ D (ooe) 


xEC\{0} 


20 Ti ( Ga) nay aap 
J q-1 


=q" l1. 


(is p) “OVa (g2)) 
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Therefore ¢(g) = 0 for at least one g 4 0. 


Lemma 2.8. If R= & and 





log, (q= wn p(R)) 
= ~ og, EST 
then 
EHG a (GYM 


Proof. For this value of u we have 


(4-10 -/R)) _ ec —1)(1 ay 
p(R) p 


u ah) 
and so (q — 1)(4) +(1-p)*= a Further, 


R-1 —H,(p(R)) _ Ge ra _ (R) A, 








q =q al 
Hence 
rs {e(a -o 5) +a- y 
= t s TES "su zay 
= {aya = amy = e 1% 
= We -1)(1- A a 
aot p(R) 
q (A s anes 
= (ya zp] 











Combining Theorem 2.53 and Lemma 2.8 we get the following theorem. 


Theorem 2.54. If p € [o, 1 and R = È < C(p) = 1 — H(p) (that is, 
p(R) 2 p), then 


p np(R) 
Pye[n, k, p; q] < (=) (1 — p), 
i 
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2.6.3 Asymptotic bounds 


It turns out that Pae(n, M, p; q) decreases exponentially with n. Therefore, 
define 


log, Pae(n, [g”"] , p; q) 


Tue(R, p; q) F lim SUP Tue (n, R, p; q), 

Tye(R, p; q) = lim inf tue(n, R, p; q), 
and 

Tue(R, p; q) = lim Tue(n, R, p; q) 
when the limit exists. A lower (resp. upper) bound on Pae(n, M; q) will 
give an upper (resp. lower) bound on Tuel(n, R, p; q). 
From Theorems 2.48 we get the following bound. 

Theorem 2.55. For 0 < Ry < Ro < 1 we have 


Tue(R2, p; q) < —d(R1) tal a) — (1 — ô(R1)) log; (1 — p) + Rı = R2. 


From Theorem 2.44 we can get upper bounds on 7ye(R, p; q). The sum 
contains |log,(M)] +1 terms. For the term for i = wn we get 


kon ((7 Jaeno G 


where 
f(w) = wlog,(w) + (1 — w) log, (1 —w) —(R+w—1) 
Pp qp 
The function f(w) takes its minimum for w = -4 and the corresponding 


minimum is 1— R. The smallest value for w with 7 in the summation range 
is w = 1 — R. Hence, if Ta < 1— R, then f(w) > f(1— R). This gives the 
following theorem. 
Theorem 2.56. (i) For 1— z < R< 1 we have 
Tue(R, piq) <1- R. 
(ii) Foro < R<1- oT we have 
qp 


Tue(R, pi q) < R{log,(R) — log, (1 - o 


HEA R) {log,(1 - R)— ion S): 
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Theorem 2.57. For 0 < R < C(p) we have 


p 
Zue(R, p; a) > —P(R) log, (77) ~ (1 - p(R)) log, = p). 
For0 < R< 1 we have 

Tuc(R, piq) 2 1-R. 


Proof. The first bound follows directly from Theorem 2.54. From The- 
orem 2.43 we get 
Ping p; q) < Pal k, p; q] Sa" 
and so 
log, Pae(n, [q?”| »D; q) 
n 


for 0 < R < 1. This proves the second bound. The two bounds coincide 
when R = C (p); the first bound is stronger when R < C(p). 


<R-1 














Combining Theorems 2.56 and 2.57 we get the following theorem. 


Theorem 2.58. For 1 — oa <R<1 we have 
Tue(R, p; q) =1-R. 


Any upper bound on 6(R) will give an upper bound on 7ye(R,p). The 
best upper bounds on 6(R) known is the LP-bound (see e.g. MacWilliams 
and Sloane (1977, Chapter 17)). Using any upper bound on 6(R), the first 
of the bounds of Theorem 2.55 is better for small values of p, the other for 
larger values of p. 

Results that are sometimes stronger than Theorem 2.56 are known. 
They are given for the binary case and we list those here, but omit the 
proofs. To formulate the results, some additional notation is needed. Let 


pr Ry = 4) Be 1) (L= HU). 


N 


Theorem 2.59. For 0 < R < Rupı(p) we have 
Tuel R, p; 2) < 1 — R — H2(ôLp1(R)) + T(ôLpı (R), p). 
For 0 < R < Rrp(p) we have 
Tue(R, p; 2) < 1 — R— H2(ôLp(R)) + T(ôLp (R), p). 
For min{ Ripi(p), Rup (p)} < R< 1 we have 
Tue(R, p;2) =1— R. 
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2.7 Optimal codes 


2.7.1 The dual of an optimal code 


Theorem 2.60. If C is an [n,k;q| code which is optimal for p, then C+ 
q—1—qp 


is optimal for . Moreover, 





n q—1-—qp a j 
Pyeln, k, p; q] = q? (1 — p)” Pac [rnb ad poh Spy 


Proof. Theorem 2.7 implies that if Pue(Ci,p) < Pae(C2, p), then 


P. (o, 1) sP (o), 
q — qp q — qp 











and the theorem follows. 





Since ie runs through [o, =t when p does, we have the following 


corollary. 


Corollary 2.12. If C is a code which is optimal for all p € [o, a , then 


so is C+. 


2.7.2 Copies of the simplex code 


Let k and s by any positive integers. In Subsection 1.2.2, we described the 


simplex codes and the corresponding generator matrices I'w. Let C be the 
jo 
q 


7k; q] code generated by the matrix obtained by concatenating s 
copies of I’. This code has minimum distance d = sq®7t. From Theorems 
2.42 and 2.49 we get 
P q n— s(q* = 1) 
Puo(C,p) = (a -1)(—*5) (1 — p) tapel ZI 

Hence the code C is optimal. In particular, for s = 1 we get the simplex 
code whose dual is the Hamming code. Hence the Hamming codes are also 
optimal. 

In the next chapter (in Section 3.2) we describe the optimal binary codes 
of dimension four or less. 


,k,p;q|. 


2.8 New codes from old 


As illustrated in the previous chapter, there are a number of ways to con- 
struct new codes from one or two old ones. For many of these constructions, 
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the new code may not be good even if the old is/are. We shall give a num- 
ber of examples which illustrate this. The *-operation is atypical in this 
respect. It gives a systematic way to make long good codes and we discuss 
this construction first. 


2.8.1 The *-operation 


Theorem 2.61. Let C be linear |n, k; q| code. 
(i) If C is good, then C* is good. 
(ii) If C is proper, then C* is proper. 
k—1 


q geal 
Proof. Since (4) (l-—p) = * ig increasing on lo, 1] and 
p geo k-11 
Pye(O",p) = (F) O-o) T Puel, p), 


(ii) follows. Further, 
k-1 E E es 
(=) a =p) < (-) as 
q-1 q 


Theorem 2.61 shows that we can make longer good codes from shorter 
ones. However, there is a stronger result which shows that starting from 
any code and using the *-operation repeatedly, we will eventually get a 

















and so (i) follows. 





good code, and even a proper code. 


Theorem 2.62. If C is an [n,k,d;q] code and r > max{0, (q — 1)n — qd}, 
then C™* is proper. 
Proof. Ifd > tn, then C is proper by Theorem 2.15. Consider d < 


in. Then r > 0 and C™ is an [n+ rŠ, kd + rasa] code. If 





-1 Boi 
derg > 2 intri}, 


that is, r > (q — 1)n — qd, then C™ is proper by Theorem 2.15. 

















If r > max{0, (q — 1)n — q}, then the condition on r is satisfied for all 
d > 1. Hence we get the following corollary. 


Corollary 2.13. If C is an |n, k;q] code and r > (q— 1)n— q, then C™ is 
proper. 
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We can get the weaker conclusion that C’™* is good under a condition 
on r that is sometimes weaker. 


Theorem 2.63. If C is an [n,k;q| code and r > (q—1)(n—k), then C™ 
is good. 


Proof. By Theorem 2.51, Pye(C,p) < (1— pert =r — pth. Hence, 


k-1_ 


Pe(O,p) = {(<Pq)” 0-2 V Pal) 
k= k-1 


<{(4)' a-y a-pa- a-p} 


q—1 





= (4) P - py EHO fy = (1 —p)*}. 
q-1 
Clearly, 1 — (1 — p)* is increasing on [0, (q — 1)/q]. IE r > (q— 1)(n— k), 
then 
k-1 _ 1 


=ij 
rgt > i 7 fata +r a 


k-1 ge a kya : y 
and p"@ ‘(1 — p)” at +) is also increasing on [0, (q — 1)/q]. Hence 
we get 


e ey (+)"} 
q q 


ky -1 

= ie = Rare) 
n+r 4t 

q 


qi q 





IA 


Prue (ce ’ p) 

















In Remark 2.7 following Theorem 2.51 we gave an [n,k;q] code C for 
which Pye(C,p) = (1 — pyr kf -(1- p)*}. It is easy to check that for 
this code and r = (q — 1)(n — k) — 1 we get 


train) _ (Ey E Cg 
dp p=(q-1)/q q-1 
Hence, C”* is bad. This shows that the bound on r in Theorem 2.63 cannot 
be improved in general. It is an open question if the bound on r in Theorem 
2.62 can be sharpened in general. 

As a final remark, we note that if (q — 1)(n — k) > (q — 1)n — qd > 0, 
that is, 








-1 -1 
c ge ay, 
q q 
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then Theorem 2.63 is weaker than Theorem 2.62. 
From Corollary 2.13, we get the following theorem. 


k k 
Theorem 2.64. For any q, k, andn > qa ag =) there exists a proper 
[n, k; q] code. 


Proof. Let n = r2=) +v where 0 < v < Č} —1 andr > qt—1-¢. 


First, consider v > k. Let Co some [v, k; q] code. Since 


gk —1 
q-1 








rz -1-4=(4-1)( 1) -42 4- 1v-4, 


Corollary 2.13 implies that Cő* is a proper code of length n. 


Next consider v < k. Let Cı be the oe +v code generated by the 


g 


that is, [, concatenated by v distinct unit vectors. The weight distribution 
of C4 is easily seen to be: 


matrix 


A= 1, 


k-v 
Agr-1jq-y) =" 1, 
V 


A(qt—1)/(q—-1)+i = (Ja = Ligh @ for 1 < a < V. 











l lines qe-1_ 
Since both (4) Ti (lp) = = and q” — (1 — p)” are increasing on 














(0, (q — 1)/gq], C1 is proper. Hence Cus is a proper code of length n. 


We believe that proper (n, M; q) codes exist for all q, n and M, but this 
is an open question. 
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2.8.2 Shortened codes 


Consider a systematic [n,k;q] code C and let C® be the shortened 
[n — 1, k — 1] code: 


Ce = {x E€ GF(q)"" | (O|x) € o}. 


Again the shortened code may be good or bad, whether the original code 
is good or not, as shown by the examples in Figure 2.1. 





C5 proper C® bad 
100 1001 
C proper 010 0100 
001 0010 
Ac(z): 14+ 3243274 23 1 +22 +22? 4223+ 24 
Acs(z): 1+2z +2? 1+2z+ 2? 
Ac(z): z —22 + 23 + 24 
1000000 100111 
C bad 0101011 010000 
0010111 001000 
Ac(z): L+z+3244+325 14224274 24+2254 2 
Acs(z) : 1+324 1+2z+ 2? 
Ac(z): z —22? — z2? + 224 + 225 


Fig. 2.1 Examples of some shortened binary codes. 


We note that 


—1 k-1 _1 k1 —1 
Bela ——) -/ __- < l aBa =}. 
q q” q” q 





It is natural to ask if Pye(C*,p) < Pue(C,p) for all p € lo, st), In 
terms of the weight distribution this can be written as 
Ac(z2) = { Ac(2) = 1} =i 2){ Ace (2) = 1} >0 


for all z € [0,1]. The examples in Figure 2.1 show that this may or may 
not be true. For these examples, q = 2, the codes are linear and they 
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are given by their generator matrices. Clearly z > 0 for z € [0,1]. But, 
=z? 423424 < 0 for z € (0, Z), and —2z? — z3 + 2z4 + 225 < 0 for 
z € [0,0.87555]. 





2.8.3 Product codes 


Let Cı be an [n1, kı, d1;q] code and C2 an [n2, ke, da; q] code. Let C be 
the product code. This has length n = nınz, dimension k = kık2, and 
minimum distance d = dıd2. There is in general no simple expression for 
Ac(z) in terms of Ac, (z) and Ac,(z). 


Theorem 2.65. Let dı and dg be fixed. The product code C of an 
[n1,k1,d1;q] code Cı and an |n2,k2,d2;q| code Cy where nı > kı and 
nı > na > ke is bad if nı is sufficiently large. 


Proof. First we note that 
n — k = nins — kı k > nina — (nı — 1) (n2 — 1) =n F N2 — 1 > nı. 


Rough estimates give 
ue(C, 2) 
rey a ‘(on y) (1- g 
(Gap) 0-3) 
n/dyd 
aig) elon) 


2 (as) mp) 
1 USI (ina) 
if n > 2d. Since q™ is exponential in nı whereas (nın2)f is polynomial, 
the theorem follows. 


IV 

















2.8.4 Repeated codes 


Let C be an (n, M, d; q) code and let r > 1. From the definition of C”, we 
see that 


Acr(z)= { Ac(2)} 
In particular, 


Aal”) = rAalC). 
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Theorem 2.66. If C is an (n,M,d;q) code where 1 < M < q”, and 
0<p<1-— wi, then 
Pue(C", p) 
Pist@?; (q = 1)/q) 


Proof. The condition 0 < p < 1 — ai is equivalent to (1 — p)” > i 
Hence 





— œ when r —> œ. 





Pool" p) 2 rAa(C)(@— Fe —) "= 
> MO Gam) 
= rAa(©)( qiy) Pe (o; 1) 
and so 
Fa acim? E > * 











when r — oo. 





Corollary 2.14. If C is an (n, M,d;q) and r is sufficiently large, then C” 
is bad. 


A careful analysis shows that C” is bad for all r > 5. 


2.9 Probability of having received the correct code word 


If C is an (n, M; q) and a code word from C is transmitted over a symmetric 
channel with symbol error probability p, then the probability that the code 
word is received without errors is (1 — p)”. The probability that another 
code word is received is, by definition, Pue(C,p). Hence the probability 
that we receive some code word is 


(1 ee p)” T Pae(C, p), 
and the probability that we have received the correct code word under the 


condition that we have received some code word is 


E (1—p)” 
Peorr(C, p) a (L= p)” + Pac(C,p)’ 


Theorem 2.67. Let C be an (n,M;q) code. The function Poorr(C,p) is 
strictly decreasing on [0, (q — 1)/q], 
Peorr(C,0)=1 and Por(C,(q— 1)/q) = 1/M. 
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Proof. We have 
1 _ Pac Pae(C,p) 
Paorr(C, p) 7 + Tp p)” 
IN p í 
= o Dp) 


Since CEN is strictly increasing on [0, (q—1)/q], 1/Peorr(C, p) is strictly 
increasing also, and so Peorr(C, p) is strictly decreasing. 














Any lower (resp. upper) bound on Pae(C, p) gives an upper (resp. lower) 
bound on P.orr(C, p). For example, from Theorem 2.51 we get the following 
result. 


Theorem 2.68. Let C be an [n,k;q| code. Then Peorr(C,p) > (1 — p)*. 


By Theorem 2.8, the average value of Pye(C,p) over all (n, M;q) codes 
is = + ={1 -(1- pr}. Hence, we get the following result. 





Theorem 2.69. The average value of 1/Peorr(C, p) over all (n, M; q) codes 
18 

q?-M M-1 1 

2 ae ———— 

Ge @-1(1-p) 


From Theorem 2.69 we also get an asymptotic result. 





Theorem 2.70. Let Cr n be an average (n,q®";q) code of rate R. Then 


1 forp<1—q*®1, 
Jim, Peorr(CrynsP) = a 


Proof. We see that 


(lies Sale 
Gea 
and 
M-1 1 qtr 
E Ge 


when n — oo. If qF7t < 1—p, then 1/Peorr(Cr.n; p) > 1 by Theorem 2.69. 
Similarly, 1/Peorr(Crn, p) > œ if g@-1>1-—p 
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2.10 Combined correction and detection 


2.10.1 Using a single code for correction and detection 


Let C be an (n, M;q) code. Let POC, p) be the probability of having an 
undetected error after the correction of t or less errors (after transmission 
over the g-ary symmetric channel). A more detailed discussion of PË (C, p) 
is outside the scope of this book, but we briefly mention some basic results. 

In 1.6.2 we showed that the probability of undetected error after using 
the (n, M,d) code C to correct t or less errors, where 2t + 1 < d, is given 
by 


POC.) X Au( H a- (2.47) 


j=d—t 
where 
j+t 
Ati = 5 AiNi(i, j) 
i=j—t 


and N;(i,7) is given by Theorem 1.21. 

Any question considered for Pae(C, p) may be considered for P&C, p) 
for t > 0, e.g. bounds, means, etc. In particular, C is called t-proper if 
PË (C, p) is monotonous and t-good if 


PO (C,p) < PÈ (0, 4) (2.48) 
q 


for all p € [0, 4]. 

In Figure 2.2 we give examples of generator matrices of two [12, 4; 2] 
codes which show that a code C may be proper without being 1-proper, or 
good without being 1-good. 


A simple upper bound 


Let C be an [n,k,d;q| code where d > 2t+ 1. When 0 is sent and y is 
received, then we have an undetected error, after correcting of t or less 
errors, if and only of y € Uxec\ {0} S(x). By definition, the spheres S;(x) 
are disjoint and the union is a subset of GF(q)”. Hence Az j < (") (q—- 1)? 
and we get the following upper bound. 


Theorem 2.71. Let C be an |n, k,d;q] code with d > 2t+ 1. Then 


POCZ T (")a- p)”. 


j=d-t 
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z 100011110000 
SEN 010001101111 
eee 001000010001 
000100010011 
AY (z) i 223 + 24 + 25 4+ 226 +427 + 428 + 29 
AW (2): 622 + 623 + 2424 + 2125 + 3728 
+4827 + 3328 + 1729 + 321° 
á 100011100101 
cool 010000011001 
A ase 001000011111 
000100011011 
A) (z): 223 + 224 + 25 + 226 + 227 +328 + 329 
AW (2): 622 + 1023 + 2524 + 2925 + 2326 





+3827 + 4028 + 1529 +9710 


Fig. 2.2 Examples of two |12, 4; 2] codes, given by generator matrices. 


The ratio Pp) (C, p)/Pue(C, p) 


We now consider how the probability of undetected error increases when 


some of the power of the code is used for error correction. 


For p = i we have 


In general 
(1 —p)~" Pue(C,p) = Aaz? + Aapiz t + 
and 
d= p” Pe, p)= At dezt + Api see 
where z = p/((q— 1)(1 — p)) and 


d 
At d-t = f 3 Aa 


d d+1 
At d-t+1 = (gta) {i+ t+ @-2)} Aa Gases 
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Hence, 


PEC) fd ei 
Pae(C, p) t d-t+1 


More terms can be added, but they soon become quite complicated. 


+(a-2)je+). 


Average 


One can find average results for PO (p) similar to those found for Pye(p) in 
Section 2.4. The proof is also similar and we omit the details. The result 
can be expressed in various ways, and we list some. 


Theorem 2.72. Let C be a set of codes of length n and minimum distance 
at least d= 2t+1. Then 


E w D (a-pe 


II 


P(C, p) 


wH(x)>2t+1 ye St (x) 
n itt p j 
= ig) S~ — p)” 
= © 40O P nin) -a 
i=2t+1 j=i—t 
wą (y) 
S a ES D a 
wyąH(y)>t+1 xESt(y) 


We consider one example. 


Theorem 2.73. For the set C =SYSL(n,k,2t+1) we have 
t n t \ 
(t) z Xo (3) = 1 P_\*(y _ pri 
Pie (C, p) < GO] ais 2 ; ea (1 —p) \. 
Proof. Let 
1 
B= sd 
es aie ( a2) (4-1) 
By (2.27) we have a(x) < 8 for all x of weight at least 2t +1. Hence 
p \wHly) 
ocase E (2) 


wyH(y)2t+1 ga 


a e 


i=0 1=0 


(1 — pY M45, (y) 
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2.10.2 Concatenated codes for error correction and detec- 
tion 


Two types of concatenated codes for combined error correction and detec- 
tion have been studied. 


m=] 


Let the outer code D be a [k,l;q] code and the inner code C an [n, k; q] 
systematic code. Let u(x) € GF(q)"~* be the check bits corresponding to 
the information vector x € GF(q)*, and let 


E = {(xlu(x)) | x € D}. 


Then E is an |n, l; q| code. Suppose without loss of generality that the all- 
zero code word is sent. At the receiving end we first use complete maximum 
likelihood decoding for the code C, that is we decode into the closest code 
word of (x|u(x)) € C, and if there is more than one closest code word, 
we choose one of these at random. If the corresponding x is a non-zero 
code word in D, then we have an undetectable error after decoding. Let 
Pae(D,C,p) be the probability that this happens. The question we must 
ask, how does use of the code C affect the probability of undetected error, 
that is, what is 


def Pie D,C,p 
R(D,C, p) = S 


By a careful choice of codes it is possible to get R(D, C, p) — 0 when p — 0. 
In particular, this is the case if dmin(C) > 2dmin( D). 


Remark 2.8. An alternative way of using the combined code is to correct 
up to t errors using code C and the use the remaining power to detect 
errors; this is a special case of the next method considered. In this case 
an error is undetected if the received vector is within distance t of a code 
word of the form (x|u(x)), where x € D \ {0}, that is, within distance t 
of a non-zero code word in Æ. Hence the probability of undetected error is 
(t) 
Pie (E, p). 


m>1 


Let D be an [mk, K; q] code and C an [n, k; q] code which is able to correct 
t errors. Let Æ denote the concatenated code. Suppose that the all zero 
vector in GF(q)”” is transmitted. Let (yi,y2,--: ,¥m) be received. The 
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probability that y; is within distance t of a fixed code word in C of weight 
i is given by 


n Š 
Pi) = oN (FY a-p. 
j=0 
If some y; is at distance more than t from all code words in C, then a 
detectable error has occurred. Otherwise, each y; is decoded to the closest 
code word y; E€ GF(q)”. 
Let the corresponding information vector be z € GF(q)*. If 
(Z1|Z2|--+|Zm) is a non-zero code word in D, then we have an undetectable 
error. The probability that this happens is then given by 


5 Ain sia, sim (C)p (i1 )p (i2) p (im). 


(i1,i2, = ,4m)4(0,0,-+» 0) 


2.10.3 Probability of having received the correct code word 
after decoding 


The results in Section 2.9 can be generalized to situation where we com- 
bine correction of up to t errors and error detection. Assume that C is 
an (n, M,d;q), that a code word from C is transmitted over a symmet- 
ric channel with symbol error probability p, and that the received vector 
is decoded to the closest code word if at most t < (d — 1)/2 errors have 
occurred. The received vector is decoded into the sent code word if it is 
within Hamming distance of the sent code word. The probability for this to 
happen is D (>) p)(1—p)"-4. The probability that another code word 
is decoded to, by definition, PLLC, p). Hence the probability that we have 
decoded to the correct code word under the condition that we have been 
able to decode 


So (pi — pyr 
P® (C, p) = E a E 
corr | p) D (G) = p)r-s + Pue(t) (C, p) 


It can be shown that POC, p) is strictly decreasing. 


2.11 Complexity of computing Pae(C, p) 


The following theorem shows that it is a hard problem to compute Pye(C, p) 
in general. For a generator matrix G, let Cg denote the code generated by 
G. 
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Theorem 2.74. The problem of computing Pae(Ca, p), as a function of a 
rational p and a generator matrix G, is an NP hard problem. 


Proof. It is known Berlekamp, McEliece, and van Tilborg (1978) that 
the following problem is MP complete: 


Given a k x n (binary) generator matrix G and an integer w, decide if 
the code Cg contains a code word of weight w. 


In particular, this implies that the problem of finding the weight distribu- 
tion of Cg is NP hard. We show that the problem of finding the weight 
distribution of Cg has the same complexity as the problem of evaluating 
Pae(Ca, p) in the sense that each has a polynomial time reduction to the 
other. 

First, computing Pye(Ca, p), given a rational p and the weight distribu- 
tion of Cg, is a simple evaluation of a polynomial using Theorem 2.1, and 
this can be done in polynomial time. 

Next, if we know Pue(Ca, pi) for n different values pi, p2,..., Pn (all 
different from 1), then the weight distribution can be determined in poly- 
nomial time from the set of n linear equation: 





zy Ai + 22 Ao DES ZT An = (1 s pı)” Pae(CG, p1), 
z2 Áı + z? Az seer aa zg An = (1 = p2)” Pae(Ca, p2), 











Zn Ay +224 +- + z2 An = (1 — pn)” Pae(CG, Pn), 


where z; = pi/((q—1)(1—p:)). Since the coefficient matrix of this system of 
equations is a Vandermonde matrix, it has full rank and the set of equations 
determine Aj, A2, ... Án uniquely. 














2.12 Particular codes 


In this section we list the weight distributions of some known classes of 
codes. Results that apply only to binary codes are given in the next chapter 
in Section 3.5. 


2.12.1 Perfect codes 
Repetition codes and their duals 


Over GF (q) the (generalized) repetition codes are the [n, 1; q] codes whose 
non-zero code words have Hamming weight n. The weight distribution 
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function is 
1+(q-1)z2”. 


Both the code and its dual are proper. In the binary case, the dual code is 
known as the single parity check code. 


Hamming and simplex codes 


For given m and q, the simplex code is the cyclic code over GF (q) generated 
by a primitive polynomial of degree m. It has dimension m and length 
gat All non-zero code words have weight q™~!, that is, its weight 
distribution function is 


n = 





1+ (a” = 1) Pa 

The dual code is the |n, n — m; a] Hamming code. The weight distribu- 
tion can be found from the weight distribution of the simplex code, using 
MacWilliams’s identity: 


m—1_y 


n- (PGR ET) oate ae 


1=0 





Both codes are proper. The extended Hamming code and its dual code are 
also both proper. 

Estimates of the minimum distances of shortened binary simplex codes 
(the partial periods of binary m-sequences) were given by Kumar and Wei 
(1992). Whether such shortened codes are good or not will depend on the 
primitive polynomial used for generating the code and the shortening done. 
Shortening of binary Hamming codes will be considered in more detail in 
the next chapter (in Section 3.4). 


Golay codes 
The binary Golay code is the [23, 12, 7;2] CRC code generated by the poly- 
nomial 

zp ppa. 
Its weight distribution is given in Table 2.10. Both the code and its dual 
are proper. 


The ternary Golay code is the [11,6,5;3] CRC code generated by the 
polynomial 


e+ zt 2427-1, 
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Table 2.10 Weight distribution of the [23, 12, 7; 2] Golay 
code. 


a: 0 7 8 11 12 15 16 23 


Aj: 1 253 506 1288 1288 506 253 1 


Table 2.11 Weight distribution of the 
[11, 6, 5; 3] Golay code. 


a 0 5 6 8 9 11 


Ai: 1 132 132 330 110 24 


Its weight distribution is given in Table 2.11. Both the code and its dual 
are proper. 

References: MacWilliams and Sloane (1977, pp. 69, 482), Leung, 
Barnes, and Friedman (1979), Mallow, Pless, and Sloane (1976). 


2.12.2 MDS and related codes 
MDS codes 


The Singleton bound says that d < n — k + 1 for an [n,k,d;q] code. 
Maximum distance separable (MDS) codes are [n,k,d;q] codes where 
d= n-— k+ 1. For these codes 


ae i—nt+k—-1 ; 
A; = —1) ( Ope = 1) 
(J & ore 
J=) 
for n— k +1 < i< n, see Peterson and Weldon (1972, p. 72). Kasami and 
Lin (1984) proved that PË (C, p) is monotonous on (0, i) for all [n, k, d; q] 
MDS codes C and all t < 4, that is, the codes are t-proper. 
The defect and MMD codes 
The defect s(C') of an [n, k, d; q] code is defined by 
s=s(C)=n-k+1-d. 


By the Singleton bound, s > 0, and C is MDS if and only if s = 0. Similarly, 
the dual defect s+(C') is the defect of the dual code, that is 


st = sH(C)=k+1-d}. 
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Faldum and Willems (1997) proved that 


Maree) Terp kt ge -1) 


i=0 


stst—1 i y k 
l ~ (k+s—i\/fj-1l+s-i 
<n ie )( ae Jen 


>i 





Let C be an [n,k,d;q] code with defect s. Olsson and Willems (1999) 
proved that if m is a positive integer and k > m + 1, then 


pe U U in (2.49) 
q” —1 
They called a code with equality in (2.49) a maximum minimum distance 
code. A complete characterization of MMD codes were done by Faldum and 
Willems (1998) and Olsson and Willems (1999). Based on this character- 
ization, Dodunekova and Dodunekov (2002) showed that all MMD codes 
are proper, and Dodunekova (2003a) showed that the duals of all MMD 
codes are proper. 
The MMD codes with s > 1 are the following codes (and any codes with 
the same weight distribution). 


St* for t > 0, where Sj, is the simplex code. 

The [q*~1, k, (q—1)q*~?; q] generalized Reed-Muller code of first order. 
For q = 2, we must have k > 4 (see e.g. Assmus and Key (1993)). 
The [12,6, 6; 3] extended Golay code. 

The [11,5,6;3] dual Golay code. 

The [q?, 4, q? — q; q] projective elliptic quadratic code for q > 2 (see e.g. 
Dembowski, Finite Geometries, Springer, 1968). 

e The [(2¢ — 1)2™ + 2°, 3, (2t — 1)2;2™] Denniston code for 1 < t < m 
(see Denniston (1969)). 





Almost MDS codes 


Codes with s = 1 are called almost MDS codes (AMDS). In particular, if 
also d2(C) = n—k+2 (where d2(C) is the second minimum support weight 
defined in Section 1.5), the codes are called near MDS (NMDS). They 
have been discussed by de Boer (1996), Dodunekov and Landgev (1994), 
Faldum and Willems (1997). The error detecting properties of almost MDS 
codes were discussed Kløve (1995b) and Dodunekova, Dodunekov and Kløve 
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(1997). Some of these codes are good, others are not. For example, if C is 
an [n, k,n — k;q| AMDS and 


Ang < L) {1 +286 4+2,/ 82 +B}, 
q\k 


where 


then C is proper. 


Expanded MDS codes 


A symbol in GF(q™) can be represented as an m-tuple with elements 
in GF(q). Therefore, an [n,k;q’™] code can also be considered as an 
[nm, km; q] code. For some Reed-Solomon codes (as well as some generaliza- 
tions), the weight distribution of the corresponding binary codes have been 
studied by Imamura, Tokiwa, and Kasahara (1988), Kasami and Lin (1988), 
Kolev and Manev (1990), Pradhan and Gupta (1991), Retter (1991), and 
Retter (1992). In particular, Retter (1991) considered generalized Reed- 
Solomon codes generated by matrices of the form 


U1 v2 eae Un 
UVU{QAL V2QQ ... UnAn 
2 2 2 
vi&i V243 «+s Un, 
viaf! vas! fee vnak-t 
where @1,Q2,...,Q@, are distinct non-zero elements of GF(q) and 
V1, V2,+-+,Un are non-zero elements of GF(q). Retter (1991) determined 


the average weight distribution for the class of codes where the a; are kept 
fixed, but the v; is varied in all possible (q — 1)” ways. Nishijima (2002) 
considered upper bounds on the average probability of undetected error for 
this class of codes, and Nishijima (2006) gave upper and lower bounds on 
the probability of undetected error for individual codes in the class. 


2.12.3 Cyclic codes 


Several of the codes presented under other headings are cyclic. Here we 
will give some references to papers dealing with cyclic codes in general, and 
more particular, irreducible cyclic codes. 
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Barg and Dumer (1992) gave two algorithms for computing the weight 
distribution of cyclic codes. 

A cyclic code is called irreducible if the polynomial h(z) in Theorem 1.2 
is irreducible. An irreducible cyclic |n, k;q] code C can be represented as 
follows. 


{ (Ti), TH (ws), a ie aa) | rE GF(4")}, 


where Tr is the trace function and £ is a root of h(z) = 0. 

McEliece and Rumsey (1972) and Helleseth, Kløve, and Mykkeltveit 
(1977) gave general methods for computing the weight distribution of ir- 
reducible cyclic codes and also gave explicit expressions for several infinite 
classes of such codes. Their papers also contain a number of further refer- 
ences. Segal and Ward (1986) and Ward (1993) have computed the weight 
distribution of a number of irreducible cyclic codes. 

One drawback with the use of cyclic codes is that they are not good for 
block synchronization. However, we can overcome this drawback by using 
some proper coset S of the cyclic code C since Pue(C,p) = Pael S, p). 


2.12.4 Two weight irreducible cyclic codes 


Baumert and McEliece (1972) and Wolfmann (1975) studied a class of q- 
ary two weight linear irreducible cyclic codes. The class is parametrized by 
three parameters, r > 1, t > 2, and s > 2, where s divides q” +1. The 
dimension and length of the code are 
Gd 

= 


k=2rt, n= 





The codes have the two weights and corresponding number of code words: 


pÈ DeD A 


w Zn, 


Aw, = n(s — 1). 


wi = (q— - 

k—1 t rt—1 
w2 = (q — Npa 

The error detecting properties for these codes were studied by Dodunekova, 

Rabaste, Paez (2004) and Dodunekova and Dodunekov (2005a). They 

showed that the codes and their duals are proper in the following cases: 


q=2,5s>2,teven,r>1, 
q=2,s=3,todd, r > 1, 
g=3,s=2,t>2, T>, 
g=3,s=4,t=2, r=1. 








In all other cases, both the codes and their duals are bad. 
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2.12.5 The product of two single parity check codes 


Leung (1983) determined Ac(z) for the product of two binary single parity 
check codes. If Cy and C2 are [l,l — 1] and [m,m — 1] parity check codes 
respectively, then the weight distribution function of the product code is 


1 S (m-1 itn 
gitm-1 D i (z +z ) g 
1=0 
The product code is proper for 


(t, m) € {(2,2), (2,3), (2,4), (2,5), (3, 2), (3,3), (3, 4), (3, 5), 
(4, 2), (4,3), (4,4), (5,2), (5, 3)}, 


in all other cases the product code is bad. 


2.13 How to find the code you need 


There are two main classes of problems we may encounter when we want 
to use an error detecting code on the g-SC: 


e how to calculate Pye(C,p) or POC, p) for a given code, 
e how to find an (n, M;q) code or [n,k;q] code for which Pae(C, p) or 
(t) ? : 
Pac (C, p) is below some given value. 


In general these are difficult problems and a solution may be infeasible. 
However, in many cases of practical interest, solutions may be found using 
the results and methods given in this and the next chapter. Since the results 
are many and diverse, we give in this section a short ” handbook” on how 
to attack these problems. 


Given C and p, how to find or estimate Pye(C,p) 


The first thing to do is to check in Sections 2.12 and 3.5 if your code is in the 
lists of codes with known weight distribution. If not, the weight distribution 
of the code or its dual may possibly be found by a complete listing of the 
code words (if k or n — k are small). Having found the weight distribution, 
Pye(C, p) can be computed from Theorem 2.1. If you have found the weight 
distribution of the dual code, Pue(C,p) can be found combining Theorems 
2.1 and 2.7. Likewise, PY (C, p) can be computed from (2.47) if the weight 
distribution is known, and from Theorem 1.14 and (2.47) if the weight 
distribution of the dual code is known. 
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If the weight distribution is not possible to obtain, you have to be sat- 
isfied with estimates for Pue(C,p). One upper bound is given in Theorem 
2.51. Another is obtained by combining Theorems 1.22, 1.23 and 2.50. Any 
partial information you may have about the weight distribution may help 
to improve the bound thus obtained. If p < d/n, Theorem 2.2 can be used. 
Lower bounds may be found using Theorems 2.42-2.48. Upper bounds on 
P$ (C, p) is obtained by combining (2.47) and Theorems 1.22 and 1.23. 


Given C, a, and b, how to find or estimate Pyc(C, a,b) 


To determine Pwe(C,a,b) exactly is usually a harder problem than to de- 
termine Pye(C, p) for a particular p. In the special cases when we can prove 
that the code is proper, and b < (q — 1)/q, then Pye(C,a,b) = Pae(C,b). 
Likewise, if b < d/n, then Pyc(C,a,b) = Pue(C,b) by Theorem 2.2. If the 
code is good, then by definition Py.(C,0,(q — 1)/q) = Pue(C, (q — 1)/q) = 
(M — 1)/q”. 

With known weight distribution of the code, the algorithm based on 
Theorem 2.35 can be used to get as good an approximation to Pyc(C, a,b) 
as we want. Otherwise, any upper bounds on the weight distribution (e.g. 
from Theorems 1.22 and 1.23) can be combined with Theorem 2.35 to give 
upper bounds on Pyc(C,a,b). The last resort is to hope that Pyc(C, a,b) 
is about average and that the upper bound on the average (2.41) is also 
an upper bound on Py<(C,a, 6) (this is not very satisfactory, cf. Theorem 
2.391). 


Given p and a bound B, how to find an (n, M; q) code C such that 
Pue(C,p) < B 


It is possible that B is too small, that is, there are no (n, M; q) codes C 
with the required property. The first thing to do is therefore to compare 
B with the lower bounds in Theorems 2.42-2.48. If B is not smaller than 
any these lower bounds, the next thing to do is to look through the list of 
codes in Sections 2.12 and 3.5 to see if any of those satisfy the requirement. 
If not, possibly a code with the requirements can be obtained using the 
*-operation one or more times on any of these codes. If k or n — k is not 
too large, the next possibility is to pick a number of (n, M; q) or [n,k; q] 
codes at random and check them. 

The problem of finding the [n, k;q] code which minimizes Pae(C, p) is 
usually harder, but the line of attack would be the same. The solution 
for k < 4 and for n — k < 4 is given in Section 3.2. Further, if n = 
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s(q* —1)/(q—1), then the optimal code for all p is C—))*, where C is the 
[((q* — 1)/(q — 1), k; q] simplex code. Further, the [(q™ — 1)/(q — 1), (¢™ — 
1)/(q — 1) — m; q] Hamming codes are also optimal for all p. 


Given a, b and a bound B, how to find an (n, M;q) code C such 
that Pue(C, a,b) < B 


This problem should be attacked in the same way as the previous problem, 
the main difference is that now it is Pye(C, a, b) which needs to be computed 
or estimated. As for a lower bound, note that Pye(C,a,b) > (M —1)/q” if 
a < (q— 1)/q < b. 


How to choose a code which can be used for different sizes of the 
information, but with a fixed number of check bits 


The problem can be stated as follows: find an |m + r,r;q] code such that 
the shortened [k + r, k; q] code is good for all k within some range, possibly 
for 1 < k < m. The main choices of codes for this problem are CRC 
codes. You should consult the sections on CRC codes and, if need be, the 
references given there. 


2.14 The local symmetric channel 


A channel is called a local symmetric channel (LSC) if it behaves as a q- 
ary symmetric channel for each transmitted symbol, but the channel error 
probability may vary from symbol to symbol. We consider linear codes 
over GF (q). Suppose that an [n, k; q] code C is used on a LSC where the 
probability that symbol 7 is in error is given by p;. A similar argument 
as for the q-ary symmetric channel gives the following expression for the 
probability of undetected error. 


Pye(C, p1, p2,°** Pn) 
= [lt -a {el e o) 


i=l 
q 
= Ace (1- TE ae = [[G- pa. 
1 


ax 





4 


Example. Let C be the [n,n — 1; 2] even-weight code, that is, C+ = {0,1}. 
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Then 


Ag (215 225 °*+ Zn) = 1+ 2122+: 2a; 


and so 


n 


Pue(C, p1, P2,°** , Pn) = {1+ TI 2p:)} Jfa 


tl 





In analogy to what we did for the q-SC, we define 
Pwc(C, a,b) = max Pae(p1, p2, vn) | pi € [a,b] for 1 < i < n}. 
Theorem 2.75. Let C be an [n,k;q| code, and let 0 <a<bK<1. For 
X C {1,2,...,n}, define z(X) =z by 
ae . ifi €X, 
í aifig X. 
Then 
Pye(C, a,b) = max Pae(C,2(X)) | xc (annt. 
Proof. For convenience, we write 
Q={pla<pi <b forl<i<n}. 
We prove by induction on j that there exists a vector y E€ Q such that 
yi € {a,b} for 1<i<j, (2.50) 
Pae(C, p) < Pue(C,y) for all p € Q. (2.51) 


First, let j = 0. Since Q is a closed set and Pye(C,p) is a continuous 
function of p on Q, it obtains its maximum on Q, that is, there exists a 
y € Q such that (2.51). Moreover, (2.50) is trivially true for j = 0. This 
proves the induction basis. 

For the induction step, let j > 0 and suppose that the statement is true 
for j — 1; let y € Q such that y; € {a,b} for 1 < i < j — 1 and (2.51) is 
satisfied. For a c € C we have 


C a ea 


wu (ci) i 
= s A #4 i (1 = yi) wale) if Cj # 0, 
wu (ci) 
v) [lig (2s 7) PE = yji wle) if cj = 0. 





p 
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Hence 


Pae(C, y) = QjYj + bj 


where 








= yy II ar ee 


ceC\ {0} if; 
cj=0 


js Ge) aa 


ceC\{0} ij 
cj=0 


q 


If a; > 0 we must have yj = b since y is a maximum point, and so (2.50) 
is true. Similarly, if a; < 0 we must have yj = a and again (2.50) is 
true. Finally, if a; = 0, then Pae(C,y) is independent of the value of yj. 
Therefore, we may replace yj by any other value in [a,b], e.g. a which 
makes (2.50) true also in this case. This proves the statement for j and the 
induction is complete. For j = n we see that (2.50) and (2.51) implies that 
there exists an X such that Pwc(C, a,b) = Pae (c, z(X)). 














Lemma 2.9. Ifa = 0 and b = 1, then 


_ Hy €C| xy) =X} 


Pue(C,2(X)) q R 


Proof. In the case we consider, z(X) is given by 


Toe 
Zi = 








oifig X 
and so 
iS ees) 1 if wg (ci) = z; = 0 and c; = ch, 
( : -) (1—z,) vs) = $ 1/(q— 1) if wale) = zi #0 and ci £ c, 
= 0 otherwise. 

Hence 

I(aj = [1a DAF EXC) = X, 

mr a 1 i 0 otherwise. 











Summing over all c € C \ {0}, the lemma follows. 
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Theorem 2.76. Let C be an [n,k,d;q| code. Then 


1 
Pye(C,0, 1) = Gee 
Proof. First, let c be a code word in C of minimum weight d, and let 
X = x(c). Then x(ac) = X for all a #0. Hence 

q—1 1 
Pxc(C,0,1) > Puo(C,2(X)) 2 Ge Gee (2.52) 

On the other hand, let X be the support of some non-zero code word. Let 

Bx = {ex |c € C and x(c) = X} 
where cx denotes the vector of length #X obtained by puncturing all 
positions of c not in X (the elements in the punctured positions are all 
zero). Let Ux be the vector space generated by Bx. Let r be the dimension 
and ô the minimum distance of Ux. Clearly, d < ô. By the Singleton bound 
we have 6 < #X —r+1. Hence r < #X — (d — 1). By Theorem 1.23, 
Ux contains not more than (q — 1)#*~‘4-») code words of weight #X. In 
particular, #Bx < (q—1)#*-(-)), By Lemma 2.9 we have 


ABren EE 1 
Pie > X = NSE < So. Seo 
(Gaye Gage Gene 
By Theorem 2.75 we have 
Pre(C, 0,1) = max{ Pye(C 2(X)) | Ved S n}} 2 
V5 , = 14> y = (q-1)@1 





This, together with (2.52), proves the theorem. 











Lemma 2.10. Consider the interval [0, (q — 1)/q]. We have 
#Hy € C\ {0} | xy) € X} 
Pre (C,2(X)) = s 


Proof. First we note that if i € X, then z; = (q — 1)/q, and so 
Zi 





Therefore 





Z; wu (ci) 1 
4 p= i l1— wy (ci) ie 
( = D ( zi) q 
for any value of c;. If i X, then z; = 0 and so 


( Zi yea E ziji vnli) = a if c; = 0, 





Hence 





II = ac _ g) enle) = a if x(y) © 
A1\G-1 0 ifx(y)Z 


Summing over all non-zero y in C, the lemma follows. 


X, 
X. 














122 Codes for Error Detection 


For X C {1,2,...,n}, let 
Vx ={c €C| x(c) cX}. 


We note that Vx is a vector space, i.e. Vx is a subcode of C. If the 
dimension of Vx is r, then 


#{y ©C\ {0} | xy) CxX}=a"-1 





and 
Hence 
ie! 
Pre (C,2(x)) aa (2.53) 


Moreover, for each r, 1 < r < k, there exists a set X such that #X = dp 
and dim(Vx) = r. Combining this with Theorem 2.75 and Lemma 2.10 we 
get the following result. 


Theorem 2.77. For an [n,k;q| code C we have 





= qal 
Pue(C,0,(q~1)/4) = max |}. 


Next, we state a lemma whose simple proof is omitted. 


Lemma 2.11. Let 1, m, and t be positive integers. Then 


qi -1 git#—1 





(i) 7” o ge (2.54) 
io 1 it _ 1 

(i) Do > DO foru>t+1. (2.55) 
q” q™ u 


We can now restate Theorem 2.77. 


Theorem 2.78. Let C be an |n, k, d; q] code. Let 





s=max{r 1<r<kandd, =d, +(r-1)}. 


Then 


q°—-1 
Pwc(C,0, (q — 1)/4) = gT 
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Proof. First, since ds — dr = s—r for 1 < r < s, we have, by Lemma 
2.11 (i), 








g AEN 1 q? ee, 1 
< 
q” q's 
forl<r<s. Fors<r<k we have, by (1.18), that dy — ds >r—s+1, 
and by Lemma 2.11 (ii) we get 
qË paee 1 q — 1 
qt q” 
Hence, by Theorem 2.77, we get 
— 1 S_] S_] 
Puc (C, 4) = — = 4. 
q qe qers 














We give some corollaries of Theorem 2.78. Usually only part of the 
weight hierarchy is needed to determine Pwe(C,0, (q — 1)/q), sometimes 
only dı. For instance, by (1.17) we have d2 > dı + 1 if dı > q+1. Hence 
we have the following corollary. 


Corollary 2.15. Let C be an |n, k, d; q] code with minimum distance d > q. 


Then 
-1 et 
Pee (C,0, ——=) =. aes 
q 


From Theorem 2.77 we see that 


—1 kif Boy 
Pac(C,0, —=) > 4 ea 

q q** q” 
In particular, Pwe(C,(q — 1)/q) = (q! — 1)/q” if and only if n = dk = 
dı + k — 1, that is, if and only if the code is MDS. 











Corollary 2.16. For an |n, k;q| code C we have 


— 1 k] 
Pac(C,0,—) > 4 
q q” 





with equality if and only if C is an MDS code. 


Remark 2.9. The average value of Pue(C, pi, p2,---,Pn) over all [n, k; q] 
codes can be shown to be 
qt -1 n 


E(Pue(C, P1, P2,- -> Pn)) = — {i-e -} 


n 
q i=l 





and so 


max{ E ( Pae(C, pı, p2, TE w) | 0<p < (q- 1)/a} = = 5 
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In analogy to what we did for the g-SC, we could call a code good if 
Pwe(C,0,(q¢ — 1)/q) = (q* —1)/q". However, we note that this is a very 
strong condition, which by Corollary 2.16 is satisfied only by MDS codes. 


It is often difficult to determine the weight hierarchy. Therefore it is 
useful to have good bounds. Corollary 2.16 gives such a lower bound. 
Theorem 2.78 and Lemma 2.11 a) give the following general bounds. 


Corollary 2.17. Let C be an [n,k,d;q] code. Then 


k-1) 


q-1 q-1\_ 7—1 q-q 
7 < Pee(C,0, =) Sari gt 
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the example to prove it given here is simpler. However, Kløve (1984a) 
gives the stronger result that there exist codes C with arbitrary large 
minimum distance and the property given in Theorem 2.39. 

2.6. Theorem 2.42 is essentially due to Korzhik (1965). 
Theorem 2.43 is due to Leontev (1972). 
Theorem 2.45 is due to AbdelGhaffar (1997). A simpler proof of the 
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Some papers discussing various such systems are Harvey and Wicker 

(1994), Seshadri and Sundberg (1994), Yamamoto and Itoh (1980). 
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Chapter 3 


Error detecting codes for the binary 
symmetric channel 


For q = 2, the q-ary symmetric channel is called the binary symmetric 
channel (BSC). In this chapter we give some results that apply for the BSC, 
but which does not generalize to general q or a possible generalization is 
not known or has not been studied. Since q = 2 throughout the chapter, 
we (usually) drop ”; 2” from the notations and write [n, k] for [n, k; 2], ete. 


3.1 A condition that implies ” good” 


For linear codes with code words of even weight only, the following theorem 
is sometimes useful to show that 


Pae(C, p) 22 "414 — 2p)” — 2(1 — p)” }. (3.1) 


Since 1 + (1 — 2p)” — 2(1 — p)” is monotonically increasing on [0,1/2], this 
is a stronger condition than C being good. 

Since all code words have even weight, the all-one vector 1 belongs to 
C+. Therefore, for any code word x € C+ of weight i, there is a code 
word in C+ of weight n — i, namely x + 1. Hence we can write the weight 
distribution function of C+ as follows: 


Ag.(z) = >> Bi( + 24), 
i=0 
where v = [2]. 


Remark 3.1. If n is even, then B, = $A,(C*). For i < % we have 
B; = A;(C+Ł). 
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Theorem 3.1. Let C be an |[n,k,d] code where n = 2v +1 is odd and all 
code words have even weight. If 


eal) eer) 


for 
(n—2d+)? +n 


d 
2 2n i 


<j< 
then 

Pye(C,p) < 2°” {1 + (1 — 2p)” — 2(1 — p)”} 
for all p € [0,5]. 


Theorem 3.2. Let C be an [n,k,d] code where n = 2v is even and all code 
words have even weight. If 


eof EGC 


(n—2d+)?+n-2 
2n—2 i 


for 


NI& 


SJs 


then 





Pae(C, p) < 2*-” {1 + (1 — 2p)” — 2(1 — p)"} 
for all p € [0, 4] : 
Proof. We sketch the proof of Theorem 3.1. By Theorem 2.4 


Pae(C, p) = 2"-* Ag. (1 — 2p) — (1 — p)” 
= 24" {1+ (1 — 2p)” — 2(1 — p)” — F(p)} 





F(p) = (2"-* — "De. {(1 — 2p)? + (1 — 2p)"-*} . 


We will show that F(p) > 0 for all p. We see that 


(1—p)" =(1—p) 5 (0 — 2p)” 


j=0 


Error detecting codes for the binary symmetric channel 131 


and 
i n—i k j itj j v—j 
EE 2 ("2 ) Bia — apy 
j=0 
Hence 
F(p) =(1—p) Xp” — 2p)’ F;, 
j=0 
where 


v—l1 é 4 
= V ; a+ 9 

F; = (2r-* —2 -5 22tl Bpi. 
p= \(%) 2j 


i=j 
We will show that F}; > 0 for all j. By assumption, this is true for d <j < 
do [e=], Consider j < $. Then 


y—1 > f n i 2 r 
re 1+3 B, =2 0 B; i+j+v-n _92j+1 j+v 
i=j 2j ne i=0 2j 2j 





2j r n r 2 

Pan f(gtyv-n ~ a 2j+1 [J FY 
= B;{,) —2” 

> 2j—1 py G) ca 


2 


2j ,. f 
j IFV=N\an-k-if[” ajtifJIrYU 

= 2 — 279 
Sa en) 


= sa) — 221+1 Sa 
j 2j J? 


where the second to last equality follows from Theorem 1.3, and so 


p= 25 (IF”) a”) 59, 
2j j 


Finally, consider j > dọ. We prove that F}; > 0 by induction on j. As basis 
for the induction, we use that Fy,-1 > 0. We observe that B,_; = 0 for 
n—dt <i<v-—1. Further 
j i+j . Ree ; 2(j—1)j+1 (i+(j—1)j 
PI) _ i+ dj + PD go) 
ee re? 
22 as re 3) 


iz - j—1)j 
(571) 
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for j > dọ and j <i<v—d+. Hence 


Fj len: v—dt eerie) 
TR aa 
v—dt 92(5- cou E a) 
> (2"-* — 2) — ys i — ey 
i=j-1 j-1 
Py-1 





=a 


by the induction hypothesis. 














Remark 3.2. If 


d a (n —2d+)} +n 
2| 7 2n , 
then there are no further values of j to check and similarly for n even. 


Hence we get the following corollary. 


Corollary 3.1. Let C be an [n,k,d| code where all code words have even 
weight. If n is odd and 


o 5 (n= 2d +n 


2 2n 


or n is even and 


d x (n—2d+)?+n-2 
7 2n — 2 


then 


Pae(C,p) < 2°" {1 + (1 — 2p)" — 2(1 — p)"} 
for allpe€ [0, 4]. 


3.2 Binary optimal codes for small dimensions 


It is an open question for which n and k there exist [n, k] codes which are 
optimal for all p € [0, 4]. However, for k < 4 such codes do exist for all 
n; by Corollary 2.12 the same is true for k > n — 4. Before we can prove 
this, we must first show a couple of lemmas. The proof is quite long and 
we divide it into some lemmas. The proof will be based on Corollary 1.7. 
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Let C be an [n, k] code generated by G whose column count function is m. 
Then 


s(U°,m) 

r p 

PCD =0 -0 D (E) > (3.2) 
UESk,k—1 P 


Lemma 3.1. If C is optimal for p € (0, 4), then m(0) = 0. 


Proof. Suppose that m(0) > 0. Let y be an arbitrary non-zero vector in 
GF(2)*. Define m’ by 


m'(0) = 0, 
m'(y) = m(y) + m(0), 
m' (x) = m(x) for x € GF(2)* \ {0, y}, 


and let C” be a corresponding code. Then 


é wh _ J s(U°, m) + m(0)) ify € U9, 
CO) ify g U°. 
Hence 
p m(0) p s(U°,m) 
Pre C; —Pre CO; = (1-p)"41- (4) (4) > 0, 
(C,p) (C",p) = (1-p) E 2 Tap 











where the last sum is over all U such that y € U € Sk,k-1- 





Let T be an invertible linear transformation on GF(2)*, and let Cr 
be a code corresponding to mo T, that is, if mr corresponds to Cr, then 
mr(x) = m(T(x)). 


Lemma 3.2. We have Pue(Cr,p) = Pae(C, p) for all p. 


Proof. We have 


s(U,mr)= Y m(T(x)) = J m(T(x)) = s(TU, m). 
xEU T(x)ETU 
Since TU runs through Sk ,k—ı when U does, we get 
ACp ast yee sa y gr ACs) 
UESk,k—1 UESk,k—1 














Lemma 3.3. If C is an optimal |n, k] code for p € (0, +) andk <4, then 
m(x) — m(y)| < 1 for x,y € GF(2)* \ {0}. 
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Proof. We give the proof for k = 3. For k = 1,2 the proof is simpler, for 
k = 4 more complicated, but similar. For the details of the proof for k = 4 
we refer to Kløve (1992). 

To simplify the notation, we write 


mı = m(100), m2 = m(010), see m7 = m(111). 
First we note that there exists a linear transformation on GF(2)? such that 
mr (100) < mr(x) < mr(010) for all x € GF(2)? \ {0}, 


and mr(101) < mr(011). 
Hence by Lemma 3.2, we may assume without loss of generality that 
mı < Mmi < mz for1<i<7, 


m5 < meg. 
Note that this implies that mz — mı > me — m5. By Corollary 1.7 we get 
Ac(z) =1+ gmitmz+ms+me i gmitmetmatm7 + gms tme +ma+my 


M1TM5 tmz tmy 


motme+m3+m 
+z 4 z™2 6 3 7 


























fz™M1I TMs TMZ Tma Af germs mera 


Suppose that the lemma is not true, that is, C is optimal, but m2 — mı > 2. 
By Lemma 3.1, mo = 0. 


Case I, mz —m, = Mmg — ms. Define m by 
my = mı +1, M = Mm — 1, Ms = m5 +1, me = me — 1, 
and ñ; = m; for i € {0,3,4,7}. Then Mñ; > 0 for all i. Let C be a 
corresponding code. Then 


Ac(z) = Ag(z) = gmaitmstms+m7 (1 ms Zz) (1 = ee?) +0 


for z € (0,1). Hence Pae(C, p) > Pae(C, p), contradicting the optimality of 
C. 


Case IT, mz — mı > mg — ms. Define m by 
Mı = mı +1, M = m -—1, 
and m; = m; for i € {0,3,4,5,6,7}. Again ñ; > 0 for all i. Let C be a 
corresponding code. Then 
Ag) = Agl Ser mams tme Cg) (1 — ig eas ms aa) 
4zmitms+matme (1 = z) (1 = a aa) 


>0 
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for z € (0,1). Again Pae(C, p) > Pae(C, p), contradicting the optimality of 
C. 


For an [n, k] code C for which m(x) > 0 for all x 4 0, define C7 to be 
an [n — (2* — 1), k] code corresponding to m~ defined by 
m~ (0) = 0, 
m` (x) = m(x) — 1 for x 4 0. 
Then (C~)* = C and 


gk-1 gk-1 


Pp (1 — p) -1 Pae(C7, p) = Pae(C, p). 














Lemma 3.4. Let C be an optimal |n, k] code for p € (0, +) where k < 4. 
Then 

(i) C* is optimal for p, 

(ii) if n>2* —1 +k, then CT is defined and it is optimal for p. 


Proof. Let Cı be an optimal [n + 2* — 1, k] code for p. By Lemma 3.3 
C1 is defined. Hence 


gk-1 


k-1 
pP (L1- p)? "Pae(C,p) = Pae(C*,p) 
= Pae(C1, p) = p a -— p)” IP uelCT, p) 


and so Pae(C*, p) = Pae(C1, p), that is, C* is optimal for p. This proves 
(i), and (ii) is similar. 














By Lemma 3.4, for k < 4 it is sufficient to determine the optimal [n, k 
codes for p € (0,1/2) where n < 2¥—1+k. This can be done by a computer 
search. The search gives the following result. 


Theorem 3.3. If1 < k < 4 and n È> k, then there exists an [n,k] code 
which is optimal for all p € [0,1/2]. 


We write n = r(2* — 1) + no where 0 < no < 2} — 1. Then the column 
count function for an optimal code is given by 
mo(x) = r + mo(x) 
and 
Ao(z)-1= z" {Ao(2)- 1} 


where mo and Ao are given in Tables 3.1-3.3. Note that in the tables, x is 
written as a column vector 
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Table 3.1 mo(x) and 
Ao(z) for k = 2 


no x Ao (z) 
101 
011 
0 000 4 
1 100 24 2z 


2 110 142242? 


Table 3.2 mo(x) and Aop(z) for k = 3 





no x Ao(z) 
1010101 
0110011 
0001111 
0 0000000 8 
1 1000000 4+4z 
2 1100000 2 + 4z +22? 
3 1101000 1+3z+3z? +23 
4 1101001 1 +622 +24 
5 1111100 142274423 + 24 
6 1111110 14 423 + 324 


3.3 Modified codes 


3.3.1 Adding/removing a parity bit 


Consider an [n, k] code C containing some code word of odd weight. Adding 
a parity bit, that is extending each code word a to (a| 57i_, a;) gives an 
[n + 1, k] code C°” where all the code words have even weight. The weight 
distribution function of C is given by 
[| 
ou. Ail 
Aces(2)= > (Azi-1 + Azi(C))2” = 3 tAc(z) + Ac(-2)}- 
i=0 
The code C may be good without C' being good and vice versa. The 
various combinations of possibilities are given in Figure 3.1 which gives a 
generator matrix and the weight distribution function of the corresponding 
codes. 
Puncturing the last position of C°” we get C. From the examples above 
we see that a code may be good without the punctured code being good 
and vice versa. 
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Table 3.3. mo(x) and Ao(z) fork = 4 
































no x Ao (z) 
101010101010101 
011001100110011 
000111100001111 
000000011111111 
0 000000000000000 16 
1 100000000000000 8 +8z 
2 110000000000000 4 + 8z +42? 
3 110100000000000 2 + 6z +622 + 223 
4 110100010000000 1+ 4z + 6z? +423 + 24 
5 110100010000001 1 +1022 +524 
6 110100110010000 1+ 3z? +823 +3z4+2ô 
7 110100110010100 14723 4+ 724427 
8 110100110010110 1+ 1424 + 28 
9 111110011000011 14+ 624+ 825 + 28 
10 111111011000011 1+4+224+4+825+47%4z 
11 111111011100110 1+ 6z5 + 6z +2274 28 
12 111111011100111 1 +1226 + 328 
13 111111111111100 14426 + 827 + 328 
14 111111111111110 1 +827 +728 





3.3.2 Even-weight subcodes 


Consider an [n, k] code C containing some code word of odd weight. The 
even-weight [n,k — 1] subcode C° is the set of code words in C of even 
weight. The weight distribution function of C® is given by 


Lz] 
Ace(z) = X` Aai(C)z”. 
1=0 


Figure 3.2 illustrates the various possibilities. 


3.4 Binary cyclic redundancy check (CRC) codes 


Consider a polynomial 


1 


glz) = 2" + gm-12" Fe +gız +1. 


The [n = k+m, k| code Cy,n generated by g(z) is a cyclic redundancy check 
(CRC) code. In general, the optimal choice of g(z) will depend on k and p. 
We describe the case m = 16 in more detail since a number of such codes 
are used in practice. 
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C°? proper C@ bad 
10 1000001 
C proper a 0100001 
0011111 
Ac(z): L+2z+27 14327+325+27 
Acex(z) : 1+ 32? 1+ 327+ 326 + 28 
100 1000 
Cae ce Gra, 
Ac(z): 14+ 22+ 2? 14+ 22+ 2? 
Acex(z) : 1+ 32? 1+ 32? 


Fig. 3.1 Codes and extended codes 


C° proper C° bad 
1000 11000 
C proper 0100 01100 
0011 00111 
Ac(z): 1 +22 +22? +223 +24 1432743294 2° 
Ace(z): 1 +222 + 24 1 +32? 
1000 10000 
C bad 0100 01000 
0010 00100 
Ac(z): 1+32+ 327+ 23 14+ 3243274 23 
Ace(z) : 1+ 32? 1+32? 








Fig. 3.2 Codes and even-weight subcodes 


Examples of CRC codes used in standards are the codes generated by 
the following polynomials of degree 16: 
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IEC TC57 z216 4 ld le pl 29 4 28 4 eh 4 ett et, 

IEEE WG771 z186 4 z144 z213 +21 4210 29 4 28 478 4 2d gd, 

CCITT X.25 gi6 4 7121 p54 7 
zZ 
z 








sey ee ey 





ANSI 16 15 2 1 
IBM-SDLC 











? 
16 ol. NG TS pT of ee PSG aie 


An example for m = 32 is the ISO 3309 with polynomial 


72 4 728 4 28 4 22 1 16 | 12 td 10 8 a Ha Haaa. 





In practice, a chosen g(z) will be used for a range of k, and the best 
choice will depend on the criteria we use. 

Methods for efficient computation of the weight distribution of CRC 
codes has been given by Fujiwara, Kasami, Kitai, and Lin (1985), Miller, 
Wheal, Stevens, and Lin (1986), Castagnoli, Brauer, and Herrmann (1993), 
and Chun and Wolf (1994). 

For p sufficiently small, a code with larger minimum distance is better 
then one with smaller minimum distance. A candidate for an interesting 
generator polynomial is therefore a polynomial such that the minimum 
distance of the corresponding [k + 16, k] codes is large for a large range of 
values of k. Castagnoli, Ganz, and Graber (1990) considered the choice 
of g(z) from this point of view. For example, the polynomial z!° + z1 + 
zl 4 gt + 294 284 274 244241 (used in IEC TC57) generates codes 
with minimum distance at least 6 for all lengths up to 151, and no other 
polynomial has this property. In Figure 3.3 we list similar polynomials for 
other values of dyin (we write only the coefficients in the polynomial). 

Based on an exhaustive search of all polynomial with m = 16, Castag- 
noli, Ganz, and Graber (1990) gave the optimal choices presented in Figure 
3.4. In the table, ne is the largest length for which dmin(Cgn) > 2. A sum- 
mary of the main properties of the codes is also given. For more details, 
we refer to Castagnoli, Ganz, and Graber (1990). 

Miller, Wheal, Stevens, and Mezhvinsky (1985) and Miller, Wheal, 
Stevens, and Lin (1986) considered the polynomials g(z) = (1 + z)p(z) 
where p(z) is irreducible of degree 15. There are 896 such polynomials. 
They used the criterion that a polynomial g(z) is better if the bound 


Pyo(CnsP) < 2-141 = 2(1 = p)” + (1 — 2p)” } (3.3) 
is satisfied over a larger range of values n. By this criterion the best poly- 
nomial turned out to be a polynomial that was optimal also by the criterion 


used by Castagnoli et al., namely the last polynomial in Figure 3.4. For 
this polynomial, (3.3) is satisfied for all n < ne. 
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dmin = coeff. of polynomial length < 


17 11111111111111111 17 
12 10101101111101101 18 
10 11101001000101111 21 
9 11000111101010111 22 
8 10001111110110111 31 
7 10010011010110101 35 
6 10011110101100101 151 
5 10101100100110101 257 
4 11010001011101011 32767 


Fig. 3.3 Generator polynomial which generate codes of a given minimum distance to a 
given length. 


Castagnoli, Brauer, and Herrmann (1993) and Wolf and Blakeney (1988) 
have done a similar analysis of polynomials g(z) of degrees 24 and 32. 

Wolf and Chun (1994) considered an alternative model for channels 
with single bursts and the use of CRC codes to detect such burst. In 
Chun and Wolf (1994) they also describe special hardware to compute the 
probability of undetected error of CRC codes. Using this hardware they 
determined polynomials of the form (z + 1)p(z), where p(z) is irreducible, 
such that the corresponding shortened code is good for a large number of 
test lengths. They gave one polynomial for each degree from 8 through 39. 
These polynomials are listed in Figures 3.5 and 3.6. 


3.5 Particular codes 


In this section we consider the error detection of some known classes of 
binary codes. 


3.5.1 Reed-Muller codes 
The rth order (binary) Reed-Muller code of length 2” is a ce hen a 


i=0 Li 


code. The first order Reed-Muller code is the dual of the extended Hamming 
code and has weight distribution given in Table 3.4. The code is proper. 
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coeff. of the polynomial g, Ne 
properties of the codes Cyn 


10011110101100101 151 
dmin > 6 for n < ne, proper for n < ne, 


11111001010011111 258 
dmin > 6 for n < 130, dmin > 4 for n < ne, 

proper for n ¢ {43,...,48} U {189,...,258}, 
Poe(Cgn;9,0.5)/Pue(Cg,n, 0.5) < 1.042 for n < ne 


10101100100110101 257 
dmin 2 5 for n < ne, proper for n < ne, 


10000000100011011 28658 
dmin > 6 for n < 115, dmin > 4 for n < ne, 

proper when n < 1127 and n ¢ {17,...,28} U {31,..., 58}, 
Pwc(Cg,n, 0,0.5)/Pae(Cg,n, 0.5) < 2.11 for n < ne 


11010001011101011 32767 
dmin > 4 for n < ne, conjectured! to be proper for n < ne, 
(t Castagnoli et al. verified that Cg,n is proper for n < 256 


and a number of larger values of n.) 


Fig. 3.4 Generator polynomials and properties of the best CRC codes for m = 16. 


Table 3.4 Weight distribution of first 
order Reed-Muller code. 


i 0 2ml gm 


Ai 1 2™-—2 1 


The weight distribution of the second order Reed-Muller code was de- 
termined by Sloane and Berlekamp, see MacWilliams and Sloane (1977, p. 
443). It is given in Table 3.5. The code is proper for m < 5, but it is bad 
for m > 6. The class of second order Reed-Muller codes is asymptotically 
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m = degree of g(z) coeff. of the polynomial p(z) 


8 10001001 

9 101100011 

10 1000101101 

11 11100111001 

13 1101110100111 

14 10111010011001 

15 110100001110111 

16 1000011100101001 

17 11000101110110111 

18 100010010010111011 

19 1011101000100010111 

20 10010010111000010011 

21 110101101110101100011 
22 1000011100100110000101 
23 10111101110110110100011 
24 100010110000111010101011 


Fig. 3.5 Generator polynomials g(z) = (z + 1)p(z) generating CRC which are good for 
a range of shortened distances. 


bad. In fact, any infinite subset of the set {R(r,m) | r > 2, m > r+ 3} is 
asymptotically bad, see Klove (1996b). 
Table 3.5 Weight distribution of second order Reed-Muller code. 
i Aj 
0, 2” 1 


Teem-—2n41 (2-1) 


Mi (27-1 


gm-—1 + 9m—-1—-h gh(h+1) 





forl<h< lz] 


Ili=m-2h+1 (2-1) 


m— m +m m/2 
gm-1 g(m?4+m+2)/2 _ 9 _ 25E ] oh(h+1) ~ 2 
ONE i—1) 


Kasami (1971) determined the weight distribution of several subcodes 
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m = degree of g(z) coeff. of the polynomial p(z) 


25 1011100100110001111110101 

26 11100101101000010110110001 

27 100010110010010101110110111 

28 1001001010010010101001111001 

29 10001011001101101110011101001 

30 100110110000010010001000101001 

31 1110101100110110111010011111101 

32 10100010100010001100010101011001 

33 110111011100110101011110100001001 

34 1001100010111001001000010010010011 

35 10111110010001101111101000010110001 

36 110110010011100010101100101110110001 
37 1001100001001000011100101110000101101 
38 10010000110101010001110010111000000111 
39 111011111010001100110101000111100111001 


Fig. 3.6 Generator polynomials g(z) = (z + 1)p(z) generating CRC which are good for 
a range of shortened distances. 


of the second order Reed-Muller codes. 


3.5.2 Binary BCH codes 


The primitive BCH codes are defined as follows. Let œ be a primitive 
element of GF(2™). Let M;(x) be the polynomial of lowest degree over 
GF(2) having af as a root (the minimal polynomial of aî). The t-error 
correcting BCH code (for short: t-BCH code) is the CRC code generated 
by the polynomial 


lem{ Mı (a), Mo(x),..., Moz(x)}. 


The code has length n = 2™ — 1, minimum distance at least 2t + 1 and 
dimension at least n — tm. The 1-BCH code is the Hamming code. Note 
that this definition generalizes immediately to g-ary BCH codes. 

The weight distribution of the dual code of the binary 2-BCH is given 
by Tables 3.6 and 3.7. See MacWilliams and Sloane (1977, p. 451f) or Lin 
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and Costello (2004, p. 177f). Leung, Barnes, and Friedman (1979) proved 
that the binary 2-BCH and the extended binary 2-BCH are both proper. 


Table 3.6 Weight distribution of 2-BCH code for 


m odd. 
i Ai 
0 1 
gm-1 _ 9(m—1)/2 am _1)(am—-2 + 9(m—3)/2 
gm-1 (2m - 1) (a +1 


2m1 4 ofm—1)/2 (2m — 1) (gm—2 _ am—8)/2) 


Table 3.7 Weight distribution of 2-BCH code for m 


even. 


i Ai 
0 1 
gm-1_gm/2 g(m—4)/2 (2” — 1) (202/2 4 1) /3 
om—1 _ gm/2-1 gm/2( am — ; 7 +1)/3 
Pies 2m —1)(2™-2 41 
gm—1 4 9m/2-1 47/2(gm — 1) (2m/2 — 1) 
grit 4 gm/2 — Lp lm—a)/2 . m ; = = 1) 


The binary 3-BCH code is a [2™ —1,2™ — 1 — 3m] code whose dual code 
has the weight distribution given by Tables 3.8 and 3.9, see MacWilliams 
and Sloane (1977, p. 669) and Lin and Costello (2004, p. 178). Ong and 
Leung (1991) proved that for m odd the 3-BCH and the corresponding 
extended code are both proper. For m even, however, Perry (1991) showed 
that neither the 3-BCH nor the extended 3-BCH are good for m > 6. 
However, these classes of codes are asymptotically good. 


3.5.3 Z4-linear codes 


Let, as usual, Z4 denote the integers modulo 4. A linear code over Z4 of 
length n is a module, that is, a subset C of Z7 such that if u,v € C, then 
au + bv € C for all a,b € Z4 (the arithmetic is done modulo 4). The dual 
code C+ is defined in the usual way via inner product (modulo 4). Let 
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Table 3.8 Weight distribution of 3-BCH code for m odd, m > 5. 


i Aj 

0 
gm-1 _ g(m+1)/2  g(m-—5)/2 (27 = 1) (om 1 1) Ce 3)/2 4 1) /3 
gmat gim—1)/2— 9(m—3)/2(9m _ 1) (5. 2m=1 — 1) (20-d/2 +1) /3 


)( 
gm—1 (2m —1)(9-22™-44.3-2"-9 4.1 

gm—1 4. 9(m—1)/2 9(m—3)/2(gm _ 1) (5-277 — 1) (21 m— D/2 —1) /3 
gm—1 4 9(m+1)/2 9(m—5)/2 (am T 1) (Cae La 1) (20m- 3)/2 _ 1) /3 





Table 3.9 Weight distribution of the 3-BCH code for m even, m > 6. 





i Ai 
0 1 
gm—2 _ a(m+2)/2 (am — 1) (am — a) (27t + a(m+2)/2) /960 
gm—1 _ gm/2 7(2™ —1)am(gm—1 4 2m/2) /48 
gm-1 _ 9(m—2)/2 (2" 1) (oun te: 2(m-2)/2) (6 2m4 16) /15 
gm—1 (2m = 1) (29 22m _ gm+2 4 64) /64 
gmt 4 9(m—2)/2 (2r = 1) (ores = 2(m-2)/2) (6 gm 4 16) /15 
gm-1 4 gm/2 aor = 1) m (ami = ze?) /48 





2m1 4 2(m+3/2 (2m — 1) (am — 4) (2m1 — afn+2)/2) /960 


$ : Z4 — GF (2)? be defined by 
(0) = (00), (1) = (01), ¢(2)= (11), (3) = (10), 
and @: Z? + GF(2)2" by 
(v1, V2, +++ ,Un) = ($(v1)|6(v2)] ---|b(Un)). 


Finally, for a linear code C over Z4 of length n we define the binary code 
(C) of length 2n by 


(C) = {o(v) | v € Ch. 


Note that ¢(C) is not in general linear; such codes have been termed Z4- 
linear. For a more detailed description of these concepts we refer to the 
paper by Hammons, Kumar, et al. (1994). In particular they prove the 
following two results which are important in our context: 
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e A Z,-linear code is distance invariant, in particular, 


Asc) (z) = AZC) (z), 


Note that both ¢(C) and ¢(C+) may be non-linear and are not dual in the 
usual sense. 

One class of codes which can be obtained this way is the Kerdock codes 
K(m) which are non-linear (2™,2?™) codes. The distance distribution is 
given in Tables 3.10, see MacWilliams and Sloane (1977, p. 456) or Ham- 
mons, Kumar, et al. (1994). 


Table 3.10 Distance distribution, Kerdock codes. 





i A; for m even A; for m odd 
0, 27” 1 1 
gm—-1 4 9(m—2)/2 gm(gm-1 _ 1) = 
gm-ly g(m—1)/2 E gm-1 (ane 3 1) 
gm-1 gmt+1 _—2 Pi Ca + 1) £9 


The Preparata codes are another class of binary Z4-linear (2™, 2?" -2m) 


codes. The distance distribution of the Preparata code is the MacWilliams 
transform of the distance distribution of the corresponding Kerdock code. 
Dodunekova, Dodunekov and Nikolova (2004a) showed that both the Ker- 
dock codes and the Preparata codes are proper. 

The Delsarte-Goethals DG(m,d) codes, where m > 6 is even, are non- 
linear (2, 20™—-D(Lm/2]—d+)+m+1) codes. In particular DG(m,m/2) = 
K(m). The distance distribution of the DG(m,m/2 — 1) codes is given in 
Table 3.11, (see MacWilliams and Sloane (1977, p. 477)). 





Table 3.11 Distance distribution, DG(m, m/2 — 1) 
codes. 


i A; 
02” I 
gm—1 4. gm/2 gm—2 (gm—1 _ 1) (2m -1)/3 
m-l 42m/2-1  gm(2m-1 — 1\ (2m-1 + 4)/3 
gm-1 ata = 1) (ae ad gm-—2 + 1) 
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3.5.4 Self-complementary codes 


A binary linear code C is self-complementary if the complement of any code 
word is again a code word, that is, if c € C, then c+ 1 € C, where 1 is the 
all-one vector. For example, the codes considered in Section 3.1 are duals 
of self-complementary codes. For a self-complementary [n,k] code C, the 
weight distribution is symmetric, that is, A;(C) = An—i(C) for all i. For 
the study of such codes, we start with a lemma. 


Lemma 3.5. Let 





Then the function 
f(p) = pP p +p" “(1 — p) 
is increasing on [0, 1/2]. 
Proof. We have 
F'(p) = ip’ *(1 py (n i)p'(1 py 
t(n = i)p" 11 =p) ap (1 — p) 
= p™ (1 — p) (n — i — np)g(p), 





where 


1l—p\r-2% np-—i 
g(p) 5 ae 





roa e SO) ot p) — (np—i)(n-i np) } 


p p? (n—i-— np)? 
= (+ = ie 1 (n— 2%) 
OA p p? (n —i-— np)? 


1 n? -n 


2 
dengo) — Go Pea) 
We see that g'(p) has its minimum for p = 1/2. Moreover, this minimum 


is non-negative since 
e E navn) n? -n 
NE fag) 
4° = 2 2 4 


Hence f'(p) > 0 for all p € [0, 1/2], that is, f(p) is increasing. 








= 0. 


i(n — i) — 
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From the lemma we immediately get the following result. 


Theorem 3.4. If C is an (n, M,d) code with symmetric distance distribu- 


tion and d > ae then C is proper. 





Remark 3.3. If C is a self-complementary code (n, M,d) and d > navn 
that is, n — (n — 2d)? > 0, then M < Te this is known as the Grey- 
Rankin bound. Further, the condition of Corollary 3.1 is also satisfied and 
so (3.1) is satisfied. 


3.5.5 Self-dual codes 


A linear code C is self-dual if C+ = C. For a self-dual code k = n — k, 
that is n = 2k. The binary self-dual codes of dimension up to 16 has been 
classified, for some larger values of k partial classification has been done. 
The weight distribution has been determined for many self-dual codes. In 
particular, the classification and weight distributions of all binary self-dual 
[32, 16] codes was determined by Bilous and van Rees (2002). An excellent 
overview of self-dual code is given by Rains and Sloane in Pless and Huffman 
(1998) pp. 177-294. 

We note that c -c = 0 if and only if c has even weight. Hence, all 
the code words of a self-dual C code have even weight. In particular, this 
implies that the all-one vector is contained in C+ = C, and so C is in 
particular self-complementary and A,_;(C’) = A;(C) for all i. 


Example 3.1. Following Perry and Fossorier (2006b), we describe the error 
detecting capability of the self-dual [32,16] codes. From Theorem 2.10, we 
see that a necessary condition for a self-dual [82,16] code to be good is 
Ag = 0 and Ay < 2. From the tables in Bilous and van Rees (2002) we 
see that there are exactly 29 possible weight distributions that satisfy these 
conditions. These weight distributions fall into four classes given in Table 
3.12. 

The first class is proper for all b. 

The second class is proper for b < 7, bad for b = 8,9. 

The third class is good but not proper for b < 3, bad for b > 4. 

The forth class is proper for all b. 

For those codes that are proper, this can in all cases be shown using 
the sufficient condition in Theorem 2.19. The code with b = 3 in the third 
class is an interesting example of a good code with two local maxima in 
the interval (0, 1/2), namely for p ~ 0.1628 and p ~ 0.4109. Also the (bad) 
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Table 3.12 Classes of weight distributions for self-dual [32, 16] codes 


Class: 1 2 3 4 
A4 0 1 2 b 
A6 4b 4b 4b 0 
Ag 364 — 8b 374 — 8b 384 — 8b 620 + 10b 
Alo 2048 — 12b 2048 — 12b 2048 — 12b 0 
A12 6720 + 32b 6771 + 32b 6622 + 32b 13888 — 49b 
A14 14336 + 8b 14336 + 8b 14336 + 8b 0 
A16 18598 — 48b 18674 — 48b 18750 — 48b 36518 + 76b 


range: b=0and2<b<8 2<b<9 b=0and2<b<10 b= 0,1,2 


code with b = 4 in this class has two maxima. 


3.6 Binary constant weight codes 


For given n and m, let 27" denote the set of all binary vectors of length n 
and weight m. A (binary) constant weight code is some subset of 07". The 
distance between two code words in a binary constant weight code is clearly 
even. If the minimum distance is 26, we call the code an (n, M, 26, m) code. 
Note that these codes are not linear (in particular, the zero vector is not a 
code word). 

In this section we will first give some results for the codes Qg and next 
results for constant weight codes in general, that is, subcodes of Qw. The 
code Q?” is essentially the same code as 27” (we have only interchanged 
the zeros and the ones). In particular, Pye(Q?-™, p) = Pue(0?", p). There- 
fore, we will assume that m < n—™m, that is m < |n/2]. In this section we 
mainly quote known results without proofs. However, we give references 
for the results. 


3.6.1 The codes Q™ 
Theorem 3.5. Let 0<m< |n/2]. For alli, 0< j < m we have 
m\ (n-—m 
man CC”) 
25( ) j j 
For all other i we have A;(Q?) = 0. 


Proof. For any code words in 7", we obtain a code word at distance 27 
exactly when j of the m ones are changed to zeros and j of the n — m zeros 
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are changed to ones. The number of ways to choose the j ones is ("’) and 














the number of ways to choose the j zeros is UPE 


Theorem 3.6. a) For n < 4, all Q'" codes are proper. 
b) For5 < n < 8, the codes Ql? are proper. 
c) For all other n and m < |n/2], the codes QY are bad. 


The threshold was defined in (2.3). In particular, 6(Q7") is the smallest 
root in the interval (0, 1/2] of the equation Pyg(Q”,p) = Pua(Q?’, 1/2). For 
p< OC), the bound Pya(Q?, p) < Pua(Q, 1/2) is valid. 


Theorem 3.7. Let 
Ci ee 
PEAS MS E ` 
i) For all n and m such that 1 < m < n we have 
32 \ 1/4 
Y(n, m) < (=) . 


ii) For all sufficiently large n and all m such that 1 < m < n we have 


w(n,m) < A(Q™) < w(n,m) + n?°y(n, m)’ 





where 
wo(n,m) = inm) + EP yin, m). 
Corollary 3.2. We have 
lim 2/2991) =1. 
Corollary 3.3. IfO < A < 1/2, then 
lim p5/4gr (1-2 (a))/2 9(22”) = 1 
yaa (2733 (1 — )8) 1" 
where H(A) is the (binary) entropy function. 
Corollary 3.4. We have 
lim n°/49(Q"/?) = Gaye: 
n—00 T 


Let Pue(n, M, w, p) denote the minimum value of P e(C, p) over all bi- 
nary (n, M,2,w) constant weight codes. A binary (n, M,2,w) code C is 
called optimal (error detecting) for p if Pue(C,p) = Pue(n, M, w, p). 
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3.6.2 An upper bound 


Let C(n,M,w) be the set consisting of all binary (n, M,2,w) constant 
weight codes. The mean probability of undetected error for the codes in 
C(n, M, w) is given by 


_ 1 
P,.(n, M, w, p) = ——— Pye(C,p). 
(n, M,w,p) Hn, Mw) secon J (C, p) 


Theorem 3.8. 


Pye(n, M, w, p) = pis 3 a (" : o)ra 


au 





Corollary 3.5. 


Palos Miah) STP TET (1) (5 ear 


t 


3.6.3 Lower bounds 


Theorem 3.9. If C is a binary (n, M,2,w) constant weight code, then 
D 2w(n—w)M 
n n(M-1 
Pue(C,p) > (M —1)(1 — p) Coe 
Theorem 3.10. Let C be a binary (n, M,2,w) constant weight code, then 
M tee (w\ (n-—w i n—2i M A 
ODER O O a E 
hte NAN 4 (a) 
There are a couple of lower bounds which are analogous to Theorem 
2.44. 


Theorem 3.11. Let C be a binary (n, M,2, w) constant weight code. Then 
Pue(C,p) > X. max{0, Fi(n, M, w)}p' (1 — 2p)" 
l=1 
where 


oe @) arya 
Fi(n, M, w) = M e -( ) 
t=max{0,w—I} ( t N) l 
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Theorem 3.12. Let C be a binary (n, M,2,w) constant weight code such 
that 


G (a 
an IM < 
(oa (aa 
for some t, where 1 <t < w. Then 


t n=l 
Pue(C,p) > Q- p) X Boa = (7 Jota -27 


w 








l=w-t 
If D is a t — (v, k, A) block design, the rows of an incidence matrix for 


D form a constant weight code Cp of length v, size Mit, and weight k. 


Theorem 3.13. Let Cp be a binary (v,M,d,k) constant weight code ob- 
tained from at — (v, k, A) block design such that A < (n — t)/(w — t) and 
d>2(w—t). Then Cp is an optimal constant weight code for all p € [0,1/2] 


and 
Cs een 


Pue(C, p) = ee 2w DD ae Ga 7 -1 (Pra = apy" 


l 


3.7 Comments and references 


3.1. Theorems 3.1 and 3.2 are from Kasami, Kløve, and Lin (1983). 

3.2. The results are taken from Kløve (1992). 

3.3. The results and examples are taken from Kløve and Korzhik (1995). 

3.4. The referred standard CRC codes are mainly collected from the inter- 
net. 

3.5. Lemma 3.5 and Theorem 3.4 are due to Dodunekova, Dodunekov and 
Nikolova (2004a) (actually, they gave a more general version of the 
lemma). 

3.6. Theorem 3.6 was shown by Wang, Yang, and Zhang, see Wang (1987), 
Wang (1989), Wang (1992), Wang and Yang (1994), Wang and Zhang 
(1995), Yang (1989). A simpler proof was given by Fu, Kløve, and Xia 
(2000a). 

Fu, Kløve, and Xia (2000a) proved Theorem 3.7 and its corollaries. 
Theorems 3.9, 3.10, and 3.11 are due to Fu, Kløve, and Wei (2003). 
Theorems 3.12 and 3.13 are due to Xia, Fu, and Ling (2006a). 

In addition to the result listed, a number of asymptotic bounds were 
given by Fu, Kløve, and Wei (2003) and Xia, Fu, and Ling (2006a). 


Chapter 4 


Error detecting codes for asymmetric 
and other channels 


4.1 Asymmetric channels 


Let the alphabet be Z, with the ordering 0 < 1<2---<q-—1. A channel 
is called asymmetric if any transmitted symbol a is received as b < a. For 
example, for q = 2, a 0 is always received correctly while a 1 may be received 
as 0 or 1. The binary channel where 


m(0|0) =1, ~z(1|0)=0, x(O/l)=p, wO]1)=1-—p 


is known as the Z-channel. 
An asymmetric channel is called complete if 7(bja) > 0 for all b < a. 
For general q, the two main complete channels considered are the one 
where each error is equally probable and the one where the errors are given 
weight proportional to a — b. For both channels z(bja) = 0 if b > a and 
m(0|0) = 1. For a > 0, the first channel is defined by 


r(bla) = { 


and the second channel is defined by 


1—p if b=a, 
a(a—1)/2P ` 


1—pifb=a, 
p/a ifb<a. 


We note that for q = 2, these two channels are the same, namely the 
Z-channel. We first give some results for the Z-channel. 


4.1.1 The Z-channel 


Let x be sent and y. By definition, P(y|x) = 0 if y <x. If y <x, then 
Ply|x) = pren (1 — pjr da ley) — pwn) -wa (y) (1 — pva Y), 


153 


154 Codes for Error Detection 


Therefore 


Pel Zp) = a5 DD pee ene A — pew), a) 


xEC yEC 
y<x 


It is interesting to compare Pae(C, Zp) and Pye(C,BSC,) for some 
codes. If y £ x for all y,x € C, then clearly Pae(C, Zp) = 0, whereas 
Puc(C, BSCp) > 0 (if #C > 2 and p € (0,1)). For other codes, the ratio 
Pue(C, Zp) /Pae(C, BSC,) may be large or small, depending on the code 
and p. 


Example 4.1. As a simple example, consider the [2* — 1, k] simplex code 
Sk. This is a linear code where all non-zero code words have Hamming 
weight 2*-1. Hence y < x for y,x € Sp if and only if y = 0 and x Æ 0. 
Therefore 





A TAE a Lam 
and 
Pye( St, BSCp) = (2* — 1p (1 p) t, 
Hence 
Poise Z 
es = zF when p—0O, 
and 


Pae(Sk, Zp) 
Pae(Sk, BSC) 


Also, for any p < 1 we have 


Pae(Sk, Zp) 
Pue(Sr, BSC) 


=œ when p—l. 


=œ when k—oo. 


A code C is called perfect for error detection on the Z-channel if 
Pue(C, Zp) = 0 for all p. By (4.1), C is perfect if and only if y £ x 
for all y,x € C, y # x. The largest perfect C of length n is obtained by 
taking all vectors of weight |3] i 

A perfect systematic (k + [logs k],2") code can be constructed as fol- 
lows: 


Cr = {(xIr(x)) |x € Z3} 
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where r(x) € Zilese El is obtained by first taking the binary expansion of 
w(x) and then taking the binary complement (changing 0 to 1 and 1 to 
0). E.g. 

C3 = {00011, 00110, 01010, 01101, 10010, 10101, 11001, 11100}. 
We note that Cy has 2* code words whereas the non-systematic perfect 
code of the same length given above has 


(“i flog, D 7 2k 


k+logs k 
2 


One general construction of (non-perfect) error detecting codes for the 
Z-channel is given in the next theorem. 


9k 
T 


code words. 


Theorem 4.1. Let a; be integers for 1 < i < k such that 
O<ay <a <- <akInN. 
Let 
C = {x € Z3 | wg (x) = a; for some i}. 


SEE (Vere 


Qi 


Then 


Pae(C, Zp) = 


Proof. There are (2) vectors x € C of Hamming weight a;. For each of 
these there are (%*) vectors y € C of Hamming weight aj such that y < x. 
a 











Example 4.2. Let a; = 2i — 1 for i = 1,2,...,k where k = [n/2], that is, 
we take all vectors of odd weight. The corresponding code Co detects all 
single asymmetric errors (and many others), and from Theorem 4.1 we get 
after simplifications: 
E n 

Another possibility is to let a; = 2i for i = 0,1,...,k where k = |n/2|, 
that is, we take all vectors of even weight. For the corresponding code Ce 
we get 








1 1 pr /1 1 
PaoelCao Z) =+- - p” (1 2) Sei |p 
(Ce, %) = 5 + 5(1-P) "+ (5-5)? 
In particular, #C, = #Ce and Pye(Co, Zp) < Pue(Ce, Zp) for all n > 1, 
p#F0. 
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4.1.2 Codes for the q-ary asymmetric channel 


For x = (%0,2%1,---,@k-1) € ZE, let 
k-1 
w(x) = > xi, the weight of x, 
i=0 
k-1 
u(x) = » (q—1-— xi), the coweight of x. 


S 
ll 


Clearly, wg(x) + uq(x) = k(q — 1). 
For non-negative integers a and s, where a < qf, let 


(@)s = (@o,@1,---,@s—1) E Z$ 


where 
s—1 
a= So ara, a; E Zq. 
i=0 
Define 
s—1 
wala) = Wq((a)s) = X ai. 
i=0 


For example, if q = 5 and k = 6, then 
ws5((1,3,1,0,2,4)) = 11 and u5((1,3,1,0,2,4)) = 13. 


Further, (33)4 = (3, 1, 1,0) since 33 = 3+1-5+1-5?+0-53, and w4(33) = 5. 
For integers a and n, let [a]n denote the (least non-negative) residue of 
a modulo n. 
For integers m > 0 and n > 0, let S(m,n) denote the number of arrays 
in Zy of weight n. 


Theorem 4.2. For a complete q-ary asymmetric channel, a maximal per- 
fect code of length m is 


{x ay ite | W(X) = pau k 
The size of this code is S(m, [=42]). 


Generalized Bose-Lin codes (GBL codes) is a class of systematic codes. 
They are determined by four (integral) parameters, q > 2 (symbol alphabet 
size), k (number of information symbols), r (number of check symbols), and 
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w where 0 < w <r. We use the notations p = r—w, o = S(w, |w(q—1)/2]), 
0 = q’, and u = 08. Finally, let 


{bu 0; but, pS »Du,o-1}; 


the set of vectors in Zg of weight ||. 

A code word is a vector in Lee It consists of an information part x 
with k information symbols concatenated by a check part c(x) with r check 
symbols. Let u = u(x). The check part, which depends only on [u],,, will 
be determined as follows. Let a = |[u],,/6|. Then 0 < a < o and 


ful, = 08 + [udo 
The check part is defined by c(x) = (c1(x)|co(x)), where 
a(x) = bua and c2(x) = ([u]o) p- 


As usual, an undetectable error occurs if a code word is sent and the 
received array is another code word. We characterize the undetectable er- 
rors. Assume that (x|ci(x)|c2(x)) is the sent code word and (y|ci(y)|c2(y)) 
is the received code word, where y Æ x. This is possible if and only if 


y CX, (4.2) 
ci(y) = c (x) (4.3) 
since w,(ci(y)) = w¢(ci(x)), and 
caly) G c2(x). (4.4) 
Suppose that (4.2)-(4.4) are satisfied, and let 
u = Uq(x) (4.5) 
and 
ju — X= Uq(y) — u(x), (4.6) 


where j > 1 and 0 < ÀA < pu. Note that À = [ug(x) — ug(y)],. Let 
[u], = a0 + [u]o and [u — A], = BO + [u — Alo. 
Since [u — A], = [u + (ju — A), we get 


ci (x) = bua €2(x) = ([u]o)p, 
c1 (y) = bu,g c2(y) = (lu — Alo). 
Hence, (4.3) is satisfied if and only if 


B=a, (4.7) 
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and (4.4) is satisfied if and only if 


(u—Alo)p © (to) p. (4.8) 


We observe that if A > 0, then a 4 £, and if [uljg < A < 0, then (4.8) 
cannot be satisfied. On the other hand, if A < [u]9, then (4.7) is satisfied; 
further (4.8) is satisfied exactly when (A), C ([u]o)p. We also note that 


(A)p C (lulo) implies that A < [u]ọ. Hence, we have proved the following 
result. 


Theorem 4.3. The code word (x|c(x)) can be transformed to the code word 
(y|c(y) Æ x|c(x)) by transmission over the q-ASC if and only if 


y Cx and (Xo E ([uq(x)]e)o, 


where = [tg(x) — ua(Y)Iy- 


We now consider the minimal weight of an undetectable error. From 
the proof of Theorem 4.3, we see that the weight of the undetectable error 
considered is 





Wq(x|c(x)) — wa(yle(y)) = w(x) — wa (y) + wa(c2(x)) — wa(c2(y)) 
= Wq(x) — Waly) + wa(([uJo)p) — wa( (lu — A)]o)p) 
= ju — A+ wWa((A)p) 
= ju — A+ wad). 





Suppose that 


p-1 p—1 
A= be ag, and X = 5 ag, 
i=0 i=0 


where a;, a, € F and (A}p C (A'}p, that is, a; < aj for all i. Then 


p— 
(A! — wal) — (AÀ — wg (A -Fu -ai —1)>0. (49) 
=0 
To 1), = (q—1,q—1,...,q—1). Hence wa((0—1})) = (q—1)p 
and ([u]o)o S (0 — 1)p. By (4.9), i F(A : C ([u]o)p, then 
A — w4(A) < [ulo — wallu]o) < 8—1 — (4 — 1)P (4.10) 
Therefore, the weight of an uncorrectable error is lower bounded as follows: 


ju (A- w(à)) > u-0+1+(4-1)p. 
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Consider the uncorrectable errors of minimal weight. We see that we get 
equality in (4.9) if and only if a = a; for all i > 1. Hence we have equality 
in both places in (4.10) if and only if 


A= 0 — € and [u]o = 0 — ô, 
where 0 <6 < e < q — 1. This proves the following theorem 
Theorem 4.4. A GBL-code detects all errors of weight up to 
(o — 1)0 + (q — 1)p, 
and there are undetectable errors of weight 
(o —1)0+(q—l)p+1. 
Undetectable errors of minimal weight occur exactly for code words of 
coweight t0 — 6 fort > 1 and0 < 8 < q— 1. For such code words, an 
error is undetectable if the weight of the error to the information part is 


u—0—e+ô where ô < e< q—1 and the last p symbols of the check part, 
namely (q—1,q—1,...,q—1,q—1—ô), are changed to (0,0,...,0,q—1-—e). 





A natural question is: given q and r, which value of w maximizes 

A(q,r,w) = (o — 1)0 + (q — 1)p. 
For q = 2 it was shown by a simple proof in Kløve, Oprisan, and Bose 
(2005b) that the maximum is obtained for w = 4 when r > 5. For q 2 3 
it seems to be much more complicated to answer the question. Numerical 
computations reported in Gancheva and Kløve (2005b) indicate that for 
q > 3, the maximum is obtained for w = 2. The computations show that 
A(q,r,2) > A(q,r,w) for 3 < q < 7 and w < 100. One can also show the 
following result. 


Theorem 4.5. Let q > 3. We have A(q,r,2) > A(q,r,1) forr > 3. For 
3<w<6 we have A(q,r,w — 1) > A(q,r,w) forr > w. 


This shows that for w < 6, the maximum is obtained for w = 2 and that for 
w > 2, the value of A(q,r,w) decreases with w. Whether this is true also 
for w > 6 is an open question. 


4.1.3 Diversity combining on the Z-channel 


For noisy asymmetric channels repeated retransmissions can increase the 
reliability, at the cost of decreasing the throughput efficiency of the sys- 
tem. One such method is called diversity combining. The scheme works 
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as follows. Let a code word x = (2#1,%2,...,%) be transmitted re- 
peatedly. Let the received word in the rth transmission be denoted by 
Xr = (Lp1,2r,2,---,Lrn). At the receiving end we store a vector z. We 
denote the value of z after the rth transmission by Zr = (Zr,1, 2r,2,--+; 2r,n): 
It is computed by 


20,1 = 0, 


Zri = MAX(Zr—1,i Vea) 
for 1 <i <n. We note that 
Zr—1i S Zr S Li. 


When the combined word z becomes a code word, this is passed on, and 
anew code word is transmitted. If the passed-on code word is different from 
the one sent, then we have an undetected error. We will further assume 
that there is a limit k on the number of transmissions of a code word, that 
is, if the combined word is not a code word after k transmissions, it is 
discarded. A special case of the protocol is a protocol without a limit on 
the number of transmissions (that is, k = o0). 

The probability that the combined word is passed on with an undetected 
error will depend on the channel, the code word transmitted x, the set 
X = Xx of code words y such that y C x (that is, y; < x; for all i and 
x Æ y), and the maximum number of transmissions k. We will here discuss 
in some detail the situation when the channel is the Z-channel (binary 
asymmetric channel) with transition probability p, and we write P(x, X; p) 
for the probability of passing on a wrong code word (an undetected error). 

We assume that x 4 0. Note that 


P(x, X;0) E 0, 

P(x,X;1)=0, f0 €X, 

P;(x,X;1)=1, f0 gX. 
Therefore, from now on we will assume that 0 < p < 1. If we introduce more 
code words into X, P, will increase, that is, if X C Y, then P(x, X; p) < 


P;(x,Y; p). One extreme case is when all y C x are code words. Then 
P =1-(1—p)”*., The other extreme is when X is empty: P(x, 0; p) = 0. 


The case when X contains a single code word 


We first consider the case when X contains exactly one code word, that 
is, there is exactly one code word y such that y C x. Let w = wa#(x), 
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u = wy(y) and d= w — u. Since Py(x, {y};p) only depends on w, u, p, 
and k, we write P(w, u; p). 

Suppose that the combined word becomes y after exactly r transmis- 
sions. The d positions not in the support of y must be in error for all r 
transmissions and the probability of this happening is pt”. The u positions 
in the support of y must become all 1 for the first time after exactly r 
transmissions. Consider one position. The probability that the 1 in this 
position is transformed to a zero in all the r transmissions is p”. Hence 
the probability that the bit in this position in z, is a one is 1 — p”. Hence 
the probability that all the u positions in the support of y are one after r 
transmissions is (1 — p")”. The probability that this happens for the first 
time after exactly r transmissions is therefore (1—p")“—(1—p"~+)“. Hence, 
the probability that the z, = y after exactly r transmissions is 


por) =p)". (4.11) 


Summing over all r we get 
P(w, u; p) = X p” — p°)" — (L-p" 1)". 
In particular (if k > d), 
Py(w,u; p) = p"(1 — p)" + O(p**). 
We can rewrite the ree for P(w, u; p): 


(w, u; p) -5 bx Vi Lip — 5 (Jene 


j=0 


k 
D Jo Ha ppt Y prev 
r=1 


k(d+j) 


= 2 (eoma -o a 


The values of P(x, X; p) when the code words in X are unordered 


We say that the code words of X are unordered if y ¢ y’ and for all 
y,y € X, y #y' €X. We observe that the event that the combined word 
becomes x after having been y and the event that it becomes x after having 
been y’, where y Æ y’, are mutually exclusive. Hence 


Py(x, X; p) = N R(x {y}; p). 
yex 
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In the special case where all the code words in X have the same weight, 
u, (and x has weight w) we get 


il- k(d+3) 


Pale, Xip) = ID (Jor a- 


Bounds on P(x, X; p) when some code words in X cover others 


In the general case when the code words in X are not unordered, to de- 
termine P(x, X; p) becomes more complex and may even be unfeasible. 
Therefore, it is useful to get some bounds. Let T be the smallest value of 
w(x) — wą(v) over all code words v and y such that v C y C x, (in 
particular, T > 2d where d is the minimum distance of the code). Define 
the set Y by 


Y = {y € X | wg (y) > wg(x)- T}. 


Then the code words of Y are independent. Hence Pk(x,Y;p) can be 
computed as explained above. Further, if pT is small, then P(x, Y; p) is 
a good approximation for P(x, X; p). We will make this last claim more 
precise. On the one hand we know that P(x, X; p) > P(x, Y; p). On the 
other hand, if the combined word becomes a code word not in Y, then after 
the first transmission, the combined word must have weight w — T or less. 
The probability that the combined word is some vector of weight w — T or 
less (not necessary a code word) after the first transmission is 


w-T w-T 
w w—j j w— w w— w 
D (a-o sa- E (Y) <0 -pT 
j=0 `^ j=0 \4 
Therefore 


Px (x, Y; p) < Pe(x, X; p) < Pe(x,¥;p) + 2”p7 (1 — p)”7. 


For the whole code C we get 


P;(C; p) = = X P(x, Xx; p). 


xEC 


1 
IC] 
4.2 Coding for a symmetric channel with unknown charac- 


teristic 


If we have to transmit over a symmetric channel with unknown charac- 
teristics, the best strategy is to do a coding which makes the number of 
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undetectable errors as small as possible for all possible error patterns. To 
be more precise, let us first consider a channel transmitting binary symbols. 
Let C be an (n, M; 2) code. If x € C is transmitted and y € C is received, 
then e = y +x is the error pattern. The error is undetected if (and only 
if)y=x+eeC. 
For each error pattern e € Fy’, let 
Q(e) =#{xe C|x+e€ Ch, (4.12) 


that is, Q(e) is the number of code words in C for which e will be an 
undetectable error. The idea is to choose C such that 


Q(C) = max{Q(e) | e € Fy’ \ {0}} 
is as small as possible. 

For channels with q elements, where q is a prime power, we consider 
the same approach. We assume that F, = GF(q) and when x € GF(q)” is 
submitted and we have an error pattern e € GF(q)", then x +e € GF(q)” 
is received. 


4.2.1 Bounds 


For a code C C GF(q)” and an error pattern e € GF(q)”, let 
Q(e) =#{xe€ C|x+e€ Ch, (4.13) 
and 
Q(C) = max{Q(e) | e € GF(a)” \ 0}. (4.14) 
Theorem 4.6. If C is an (n,M;q) code and M > 2, then Q(C) > 1. 


Proof. By assumption, C has two distinct code words, x and y, say. Let 
e= y —x. Then Q(e) > 1. Hence Q(C) > 1. 














Theorem 4.7. If C is an (n,M;q) code, where q is even, and M > 2, then 
Q(C) >= 2. 


Proof. By assumption, C has two distinct code words, x and y, say. Let 
e=y —x. Then x— y =e. Hence Qe) > 2 and so Q(C) > 2. 














Theorem 4.8. If C is an (n, M;q) code, then M(M —1) < (q” — 1)Q(C). 


Proof. The number of pairs (x, y) of distinct elements in C is M(M-—1). 
For each of the possible q” — 1 error patterns e € GF(q)” \ {0}, there are 
at most Q(C) pairs (x,y) € C? such that y — x =e. Hence M(M — 1) < 
(q” — 1)Q(C). 
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We will first consider systematic codes C for which Q(C) = 1 and q is 


odd. Since 
go < gE lq = 1) < eo a 


we get the following corollary. 


Corollary 4.1. If C is a systematic (n,q*;q) code and Q(C) = 1, then 
n > 2k. 


4.2.2 Constructions 
Construction for q odd 


Let q be an odd prime power. There is a natural linear bijection F from 
GF(q)* to GF(q*): let a € GF(q*) be a primitive root (that is, at 4 1 for 
1<i<q*—2and atl = 1) and define F by 
F : (%0,21,...,0p-1) Œ T = T0 + T10 + T20? +--+» + a*-l gy 1. 
Note that F(z0,£1,...,£ķ-1) = 0 if and only if (£o, £1,...,£k-1) = 
(0,0,...,0). Define the (2k, q"; q) code by 
C = {(£0, £1; ..-, Ek—1; Y0; Y1,- -- Yk—1) | (£0, £1,- --, £k—1) € GF (q)*} 
where (Yo, Y1,---;Yk—1) is defined by 
F(yo, Y1,- --,Yk—1) = F(a0,%1,.--;£p-1)?. 
Mapping C into GF(q*)? by F, let the image be C, that is 
C = {(x, x°) | x € GF(q*)}. 
This is clearly a systematic (2,¢q";q*) code over GF(q*). An error pattern 
(e0,€1,---,€k—1, fo, fi» ---, fk-1) maps into (e, f) € GF(q*)”, where 
e = F(e9,€1,..-,en-1) and f = F(fo, fi,.--, fk-1), 
and 
(€0,€1,---,€k-1, fo, fi,---, fk-1) Æ (0,0,...,0) 

if and only if (e, f) # (0,0). i 

Let (e, f) # (0,0) be an error pattern and suppose (x, x7) + (e, f) € C, 
that is 

(+e)? =r +f. 
Since (x + e)? = xz? + 2xe + e?, this implies that 
Ine + e* =f. (4.15) 

If e = 0, (4.15) implies that f = 0, that is (e, f) = (0,0); however this is not 
the case for an error pattern. Hence e Æ 0 and so x = (2e)~1(f — e?), that 


is, x is uniquely determined and so Q((e, f)) = 1. Hence Q(C) = Q(C) = 1. 
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Construction for q even 


Construction 1 can be modified to give a systematic (2k, q"; q) code C with 
Q(C) = 2. We give the details only for those parts at the construction and 
proof that differs. 

Let F be the linear bijection from GF(q)" to GF(q*) defined in Con- 
struction 1. Define the (2k, q*;q) code C by 


Ô = {(x,2°)| £ € GF(q*)}. 


As for Construction 1, an error pattern (eo, é€1,..-,¢@k—1, fo, fis ---, fk-1) 
maps into (e, f) where 


(eo, €1, . -,€k—1,; fo; fis . <, fk-1) £ (0,0, soe ,0) 
if and only if (e, f) # (0,0). 
Let (e, f) # (0,0) be an error pattern and suppose (x, x?) + (e, f) € C, 
that is 
(+e) = + f. 
Since (2 + e)? = x? + x?e + xe? + e°, this implies that 
setae? +e? =f. (4.16) 
If e = 0, (4.16) implies that f = 0, that is (e, f) = (0,0); however this is not 
the case for an error pattern. Hence e # 0. The equation (4.16) therefore 


has at most two solutions for x. Hence Q((e, f)) < 2 and Q(C) = Q(C) = 2. 


4.3 Codes for detection of substitution errors and transpo- 
sitions 


A transposition occurs when two neighbouring elements change places. For 
example, acb is obtained from abc by transposition of the two last elements. 
Transpositions are a common type of error in data that is handled manually. 
Therefore, there has been introduced many codes to detect substitution 
errors and/or transpositions, in particular for digital (base 10) data. 


4.3.1 ST codes 


An ST (substitution - transposition) code is a code that can detect a single 
substitution error or transposition error. We first give a simple upper bound 
on the size of ST codes. 


Theorem 4.9. If C is an (n,M;q) ST code, then M < qr". 
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Proof. If two vectors of length n are identical, except in the last position, 
then one is obtained from the other by a single substitution. Hence at most 
one of them can belong to C. Therefore, the code words are determined by 
the first n — 1 elements and therefore the code contains at most q”~! code 











words. 





It turns out that for all q > 2 and all n there exist (n,q”~1; q) ST codes. 
We describe some constructions of ST codes for various types of q. 


Construction of codes over GF(q) where q > 2 


If q is a prime power, we can consider an [n,n — 1; q] code C. Let the dual 


code be generated by (w1, w2,..., Wn), that is 
n-1 
C= { (1, a2,..-2n) € GF(a)” | So win: = o}. (4.17) 
i=1 


The code can detect single substitution errors if all 
wi #0 (4.18) 
(that is, d(C) > 2). We see that if xz; is changed to x;, then Sa wx; is 
changed by 
wit; + wiz, = wi (x; — x4) £0. 
Similarly, if x; and 2,41 are transposed, then the sum is changed by 
Witi — Witi Lig, + wei + Wipes = (Wi — Wi41)(Ti41 — Bi) #0 

if a; A £i}1 and 

wi £ Wi+1- (4.19) 
If q > 2, we can find w; that satisfy conditions (4.18) and (4.19), for example 
w; = 1 for even i and w; = a for odd i, where a ¢ {0,1}. 


Construction of codes over Zą where q is odd 


If q is an odd integer, we can make a similar construction with elements 
from Zp. We define 


C= TCE | X wizi =0 (mod a} 
j=l 


where now 

gcd(wi,g) = 1 and gced(wi41 — wi,g) =1 
for all i. The proof that this gives an ST code is similar. A possible choice 
for the w; is w; = 1 for even i and w; = 2 for odd i. 
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Product construction 


Let Cı be an (n, qi‘; q1) ST code over Fy, and C2 an (n, q+; q2) ST code 
over Fy,. Let q = qig2 and 


Fy = Fy, x Fa, = {(a,b) | a E€ Fy, and b E Fy}. 
Define 
C = {((a1, b1), (a2, b2),.-., (an, bn)) |a € Ci, b E€ Co}. 
Then C is an ST code over Fy. To check this, we first note that if (ai, bf) # 


(ai, bi), then ai Æ a; or b A bi (or both). Hence a single substitution will 
change a code word to a non-code word. Similarly, if (a;i, bi) Æ (@i41, bi41), 
then a; Æ aj41 or bi Æ bj41 or both. Hence, a transposition will change a 


code word to a non-code word. 


Any integer q can be written as 2™r where r is odd. If m > 2, Con- 
struction 1 gives an (n,2™"~1;2™) ST code and Construction 2 gives an 
(n,r"—1;r)ST code. Combining the two, using Construction 3, we get an 
(n,q"—1;q) code. It remains to consider q of the form 2r, where r is odd. 
This case is more complicated. 


Construction of codes over Z4 for q=2 (mod 4) 


Let q = 2r, where r > 1 is odd. Let the alphabet be Z,. For x = 
(21,225: asin) € ZF, 
Ne = Ne(X) is the number of even elements in x, 


we define some auxiliary functions: 


No = No(xX) is the number of odd elements in x, 


Se(x) = 0", (—1)’e; where e1,€2,...,€n, are the even elements of x, 
So(x) = 3072, (—1)'0; where 01, 02,...,0n, are the odd elements of x, 
Ky (&1,@2,...,2;) are the number of i < j 


such that x; is even and n — 7 is even, 
S(x) = (-1)"S.(x) + So(x) + 2Kn (xX). 
In particular, we see that 


Ky(21,%2,-..,2n-1) = Ky (a1, £2,- .., En). (4.20) 
Let 
C = {x € Z} | S(x)=0 (mod g)}. 


We will show that C is an ST code of size q”71. We break the proof up 
into a couple of lemmas. 
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Lemma 4.1. For c = (c1,C2,...Cn—1), define cn by 
—Cn = (-1)"S-(c) + So(c) +2Kn(c) (mod q). 
Then 
(clen) E€ C. 
Proof. Let x = (c|cn). First we note that Se(c) is even and Solc) = no(c) 


(mod 2). Hence cn = no(c) (mod 2) and so no(x) is even. This also implies 
that ne(x) =n (mod 2). If n.(c) is odd, then we get 


So(x) = So(e) E mS) — 2Kn(c), 
S-(x) = Selc). 
Combining these and (4.20), we see that x € C. 
If no(c) is even, we similarly get 
Se(x) = Selc) + (11) cn = —S,(c) — 2Kn (€), 
So(x) = S.(c), 


and we can conclude that x € C also in this case. 














Lemma 4.2. The code 
C = {x € Z} | S(x)=0 (mod q)} 
is an ST code. 


Proof. First we note that 
No(x) = So(x)=0 (mod 2) 


for all code words. We consider the various types of single errors that can 
occur. 


e If a substitution error change the parity of a symbol, then n,(c) is 

changed by one and becomes odd. 

e If a substitution error change an even element, say e; to another even 

element e’,, then Se(x) is changed to S_(x)—(—1)/(e;—e')) whereas So(x) 

and Kn (x) are unchanged. Hence $(x) is changed by (—1)/**(e;—e',) £0 

(mod q). 

e If a substitution error change an odd element to another odd element, 

the situation is similar. 

e If two elements of opposite partity are transposed, then K(x) is changed 
by one, whereas S.(x) and S,(x) are unchanged. Hence S(x) is changed 
by +1 £40 (mod q). 
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e If two even elements, ej and e;;1 are transposed, then Se(x) is changed 
by 


=(—1)? (ej — ej+1) + (—1)? (ej+1 — ej) = 2(—1)? (ej+1 — ey) 


whereas S,(x) and K,(x) are unchanged. Hence S(x) is changed by 
2(—1)9 (ej41 — ej). Since 0 < |ej41 — e;| < q = 2r and ej+1 — ej is even, 
we can conclude that e;+1—e; #0 (mod r) and so 2(—1)4(e;41—-e;) £0 
(mod q). 

e If two odd elements are transposed, the situation is similar. 














Since ČC is an ST code, its size is at most g”~% by Theorem 4.9. By 


Lemma 4.1, the size is exactly q”71. 


Verhoeff ’s code 


Another code construction for the case when q = 2r, r > 1 and odd, is 
based on so-called dihedral groups D,. This was first shown by Verhoeff 
for q = 10. We describe this code, but omit the proof that it is an ST code. 

One can map the elements of Ds onto Z19. The ”addition” table of i j 
is given in Table 4.1. An entry in the table gives i @ j where i is given in 
the left column and j in the top row. In particular, 160 = 061 = i for 
all 7, that is, O is the unit element of the group. The operation @ is not 
commutative. For example 165 = 6 and5@1= 9. We see that if ij = 0, 
then j 6i = 0, that is, j is the inverse of i in the group. 


Table 4.1 Addition table for ®. 


CONATARWNHO 
ONANTSCORWNH 
NATCMHORWN 
AntowenrInrowkw 
TOMNAWNHOK 
RWNHFOMMmANaAN 
WNRrFOKRMOAND 
NHFORWAUOAN 
HORWNAD CM © 0 
CRWNFANAMS 


For the construction, we also do a substitution in each position using 
Table 4.2 of functions fi : Zio — Zio where i are given in the left column 
and x in the top row. For i > 8, fi = fi_s. Hence, for example, fog = fo. 
The code is defined by 
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Table 4.2 Functions fi for Verhoeff’s construc- 
tion. 


fi(a) 
0 


© 
= 
N 
w 
iN 
on 
lon) 
NI 
o0 
© 


Nook WNH 
NNW BO CO Or OC 
ONNA OC} OF 
BO MOtrFONN 
DAWaAwawaw 
oOowlrFon ns 
FONNKB OOO 
WAWAWAW ED 
NOP OO Tr ON 
or ONN & © OC 
oneone oO 


{(£1,£2,..., En) | fi(£1) © fo(z2) ® +++ ® fn(@n) = OF. 


Analysis has shown that this code detects all single substitutions and all 
single transpositions. Also more than 95% of the twin errors are detected. 


Construction of a binary code 


It turns out that the maximal size of a binary ST code of length n is = ; 
We will not give the proof of this fact here, but give a code of this size. By 


Construction 2, the ternary code 
Toiora € Ze | ee =0 (mod 3)} 
i=1 
is an ST code. A binary subcode is the code 
{Cte tn) E Z} | en =0 (mod 3)). 
i=1 


As a subcode, this is also an ST code, and it can be shown that it has 


maximal size, that is |=]: 


4.3.2 ISBN 


A number of modifications of the constructions given in the previous sec- 
tion are used in practical systems. A typical example is the International 
Standard Book Number - ISBN. The traditional ISBN code is a ten-digit 
number. The first nine digits 71x72 ...29 codes information about the coun- 
try, publisher and the individual book. The last digit x19 is a check digit 
determined with check vector (10,9,...,1) modulo 11. This code can de- 
tect any single substitution or transposition error. Note, however, that 
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ws + we = 6 +5 = 0 (mod 11). Hence twin errors in positions 5 and 6 are 
never detected. A (digital) information vector (x1, 72,...,%9) € Z% deter- 
mines a check symbol x19 € Z11. If x19 = 10, the check symbol is written 
X. Some countries have chosen to avoid the symbol X by not using the 
information vectors that have X as check symbol. 

From January 1, 2007, a new ISBN code with 13 digits has been in- 
troduced. The first three digits y,y2y3 are 978 (later 979 will also be 
used), the next nine digits ysys...yi2 are now the information, and the 
last digit y13 is the check digit. The code is determined by the check vector 
(1,3,1,3,...,1,3,1) modulo 10. This choice of check digit is the same as 
for the bar code EAN described below. In fact, most books published in 
recent years contains both the ten digit ISBN number and a bar code with 
the new 13 digit ISBN. If you look at one, you will notice that the check 
digits are (usually) not the same. Since gcd(1,10) = gcd(3,10) = 1, this 
code can detect all single substitution errors. However, transposition of x; 
and x,;+1 will not be detected if 2; = zj +5 (mod 10). 


4.3.3 IBM code 


The IBM code is also known as the Luhn code after its inventor. It is 
usually presented as a code over Z10, but here we consider the more general 
alphabet of size q = 2r where r > 1 is odd. We gave two constructions for 
this alphabet size in the previous section. The IBM code is simpler, but it 
cannot detect all transpositions. 


For a sequence (#1, £2, .. ., &n—1) a check symbol zn is chosen such that 
n 
XO u=0 (mod q) 
i=1 
where 
Y= ti for i = n,n — 2,n — 4,...; 


Yi = 22; for i = n — 1,n — 3,n — 5,... and 0 < zt; Sr- 1, 

Yi = 2x; + 1 for i = n— 1,n — 3,n — 5,... and r < t; > q-—l1. 
Note that y; runs through Z, when x; does. Hence, any single substitution 
error can be detected. Consider a transposition of x; and x;4; where n — i 
is even. Then the sum )*""_, y; is changed by 
Zi + 2£ip1) + (£i+1 + 22i) = £i — Vig if z; < r and x41 <T, 
aps 


S 








( 
+ 2£i41 + 1) + (£i+1 + 2x) = Zi — Liy — l if xz; < r and x41 > T; 
t+ 20441) + (£i+1 + 2zi + 1) = Ti — i41 + 1 if x; > r and £i41 < T 
Vir 2£i+1 Bile 1) oF (£i+1 r 22; + 1) = Ti — V4+1 if Ti > r and Ti+1 > i 








T; 


S 


—( 
-( 
=| 
=I 
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We see that the change is not congruent zero modulo q, except in two cases, 
namely (xi, xi+1) = (0,q — 1) and (a, 2:41) = (q — 1,0). Hence, the code 
detects all transpositions, except transpositions of 0 and q — 1. 


4.3.4 Digital codes with two check digits 


In some applications there are other types of errors in addition to ST errors 
that are relatively frequent and therefore we want codes to detect such 
errors. 

We give one example modeled the Norwegian personal number 
codes. In Norway each person is assigned a unique 11 digit number 
P1P2P3P4P5P6P7Ps8P9Pi0P11 Where pip2p3papspe is the date of birth in the 
form ddmmyy, p7psp9 is the persons serial number and pjopi1 are check 
digits. In addition to simple substitutions, twin errors, and transposi- 
tions, common types of errors are interchanging date and month (i.e. 
pip2 + p3p4) and interchanging date and year (pipo © pspe, that is, 
ddmmyy => yymmdd). Also, interchanging month and year happens. The 
check digits are chosen using a weighted sum modulo 11, discarding serial 
numbers which produce 10 as value for one or both of the check digits. How 
may the weights be chosen? If we choose the weights 


109876543210 
oe) ee 


we know that all single substitutions, twin errors, and transpositions are 
detected. However, if pı + p2 = p3 + pa (mod 11) interchanging date and 
month will not be detected. We modify (4.21) as follows: Suppose that we 
multiply the first column of (4.21) by s and the third column by t to get 
10s98t76543210 
( sl ee 

As long as both s and t are non-zero modulo 11, we can still detect all single 
substitutions, twin errors, and transpositions (from one or the other of the 
check digits). When pip2 and psp4 are interchanged, the sum defining the 
first check digit is changed by 


—(10sp, + 9p2 + 8tp3 + 7p4) + (10sp3 + 9p4 + 8tpi + 7p2) 


= (10s — 8t)(p3 — pı) + 2(pa — p2). (4.22) 
Similarly, the sum defining the second check digit is changed by 


= (s — t)(p3 — p1). (4.23) 
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We want at least one of these to be non-zero (modulo 11) if pips 4 p3pa. 
This is guaranteed to be the case if s # t (mod 11). In this case, the sum 
defining the last check digit is non-zero unless pı = p3. On the other hand, 
if pı = p3, then po Æ p4 and the change in the first sum is non-zero. 

Similarly, if t # 1 (mod 11) we are guaranteed to detect any transpo- 
sition of month and year, and if s # 1 (mod 11) any transposition of date 
and year. For example, s = 3 and t = 2 gives the check matrix 


89576543210 
381211111111 


of a code which can detect all single substitution errors, twin errors, trans- 
positions, or the interchange of any two of the date, month, or year. 
The weights actually used in the Norwegian system are different, namely 


37618945210 
54327654321 


4.3.5 Barcodes 


There are a number of variants of barcodes. They usually have error detec- 
tion on two levels, both in the coding of the number itself with a check digit 
and in the coding of the individual digits as a sequence of bars. As an illus- 
tration, we consider the important European Article Numbering EAN-13. 
The check digits are the same as for the new ISBN, that is, the weights are 
(1,3,1,3,1,3,1,3,1,3,1,3,1) modulo 10. We denote the 13 digits number 
by (x1, T2, T3; , 013). 


Table 4.3 Symbol encoding for 
barcodes 


left A left B right 


0001101 0100111 1110010 
0011001 0110011 1100110 
0010011 0011011 1101100 
0111101 0100001 1000010 
0100011 0011101 1011100 
0110001 0111001 1001110 
0101111 0000101 1010000 
0111011 0010001 1000100 
0110111 0001001 1001000 
0001011 0010111 1110100 
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Each digit, except the first zı, are encoded into seven bars (black or 
white) of the same width. In this presentation, we represent the bars by 
1 (black) and 0 (white). The total barcode starts with lead sequence 101, 
then barcodes for the six digits x2, £3, £4, £5, £6, £7 in the left part (that is 
42 bars) follows, then a separator 01010, then the barcodes for the six digits 
£8, £9, £10, £11, 12, £13 in the right part, and finally a trailer sequence 101. 
The bars for the lead, the separator, and the trailer are longer than the 
others. 

The seven bars encoding a digit contains two runs of zeros and two runs 
of ones. The codes for the digits in the left part all starts with 0 and ends 
with 1. For the digits in the right part, the codes starts with 1 and ends 
with 0. The encoding is done using Table 4.3. 

For the six digits in the right part, the codes are listed under the heading 
*right”. For the six digits in the left part, the encoding is done either with 
the code listed under ”left A” or the one listed under ”left B”. The choice 
between the two is determined by x, using Table 4.4. 


Table 4.4 Choice of A or B. 


1 2 3 4 5 6 7 


8 
jet 
pe 
io) 
© 


rrrrrpe 
DOr ee 
Denner 
> Wupp 
wU eue 
Dernne 
>e Wue 
W> Weupe 
>Weee 
>Weee 


We illustrate with one example. 
Example 4.3. Suppose that 
T1T2... 12 = 978981270586. 
The check digit x13 is determined by 
£13 = —{9+3-7+8+3-94+8+3-1424+3-7+0+3-5+84+3-6} = —140 =0 (mod 10). 


The barcode for 9789812705860 is given in Table 4.5 over two lines and 
where the encoding is marked with A, B, and R (right). 
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Table 4.5 Barcode example. 


7 (A) 8 (B) 9 (B) 8 (A) 1 (B) 2 (A) 
101 0111011 0001001 0010111 0110111 0110011 0010011 01010 


7 (R) 0 (R) 5 (R) 8 (R) 6 (R) 0 (R) 
1000100 1110010 1001110 1001000 1010000 1110010 101 


4.4 Error detection for runlength-limited codes 


For this section we find it convenient to introduce a different notation for 
binary sequences. First, 0° denotes a sequence of a zeros. Any binary 
sequence of weight w, say, can then be represented as 


0910101. ..02-110%, (4.24) 
The binary sequence (4.24) is said to be a (d, k) constrained sequence if 
bo > 0, by > 0, and d < b; <kforl<i<w-1l 


where 0 < d < k < œœ (k = œ means that b; may be arbitrarily large). For 
example, 


02102104107101 = 00100100001000000010001 


is a (2,7) constrained sequence. Let Ryd, denote the set of all (d, k) 
constrained sequences of length n. A (d, k) run-length limited code of length 
n is some subset of Rn,a,k- Both the set of even weight code words in Rn,d,k 
and the set of odd weight code words in Rn,a,k are codes that clearly can 
detect single bit errors, and the larger of the two has size at least S#Rn,a, i 

Run-length limited codes have their main application in magnetic 
recording. One type of errors occurring are substitution errors. Another 
type of errors which is common in this context are shifts, that is, a one 
is moved one place to the left or to the right. In the terminology of the 
previous section, a shift is the same as a transposition. 


Construction of a non-systematic code 


The set Rn,a,k can be partitioned into three subcodes that can detect any 
single bit-error or shift. Hence there exists such a code of size at least 
4# Rn,d,k- The partition is as follows. 

For a sequence c represented as (4.24), let 3; be the parity of b;, that is 
3; = bi (mod 2) and 8; € {0,1}. Further, define a sequence 


S—1 = Q, 50, $1,.-- Sw 


176 Codes for Error Detection 


of numbers from {0, 1,2} by 


This is called the check sequence of c. Note that 2—8; #0 (mod 3). Hence, 
if si_, #;-1 (mod 3), and s; = ((2 — B;)si-1 + ĝi) (mod 3), then si Æ si 
(mod 3). The error check value of the sequence is defined by s(c) = sw. 
The couple of examples in Table 4.6 illustrate this. We see that s(c1) = 1 
and s(c2) = 0. 


Table 4.6 Examples of check values. 








cı = 0710102102 bb = 2, Bb, =1, be =3, 03=2, 
0, fi=1, B2=1, f3 =0, 


so = 0, sj =l, s2 = 2, s3 = 1, 


























c2 = 103101031 bo =0, bi=3, b2=1, b3=3, b4=0, 
8=0, 6=1, ß2=1, ß3=1, ĝß4=0, 


so = 0, si =l, s2 = 2, s3 = 0, s4 = 0. 




















Now, suppose 7 > 1. The jth run of zeros is preceded by a 1. Let 
the sequence obtained by changing this 1 into 0, be denoted by c’. This 
sequence has one run of zeros less than c since the (j — 1)th and jth runs 
now have been combined into one, and its parity is y = (6j;-1 + 1 + {;) 
(mod 2). Let sġ, {,---;$%,—1 be the check of c’. Then 


s; = si fori < j — 2, 


((2—7)sj-2 +7) (mod 3), 
((2 — 3;:)s;_» + Bi) (mod 3) for j < i < w. 


1 
J 


Si 
1 
Si—1 


1 


By (4.25) 


sj = ((2 — Bj)((2 — Bj-1)8j-2 + Bj-1) + Bj) (mod 3). 


The possible values of s; and s/,_, are given in Table 4.7. 


Table 4.7 Possible values of sj and Sha 





Bi Bii 7 sj = sh = (sj —s}_1) (mod 3) 
0 0 1 Sj—2 sj-2t+1 2 
O 1 0 2sj 2+2  2sj-2 2 
1 0 0 Based  2sj-2 1 
1 1 1 Sj—2 + 2 Sj—2 + 1 1 
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From Table 4.7 we see that 





8; — sj =2-—; #0 (mod 3). (4.26) 


Since 2 — 8; # 0 (mod 3) for all i, induction shows that s; # s/_, for 
t= j,j+1,...,w. Hence, s(c) Æ s(c’). 

Changing a 0 to a 1 is the opposite operation. Hence this will also 
change the check value. 

Finally, consider shifts. Let c” denote the sequence obtained by shifting 
the 1 preceding the jth run of zeros one step to the right. Then bf/_, = 
bj-1 +1 and bf = bj — 1. In particular, 67 = 1 — @;. If we remove this 1 
from c” we again obtain c’. Hence, by (4.26), 


si Sa = 2 (1 B;) = bj + 1. 


Combining this with (4.26), we get 





sj — 8; =1-—26;4#0 (mod 3). 


Again, induction shows that s(c”) 4 s(c). 

This shows that the set of all (d,k) constrained sequences of length n 
with a fixed check value is code that can detect any single bit-error or shift. 
Since we have three possible check values, this defines a partition of Rn,d,k 
into three such codes. 


Construction of a systematic code 


Let c be a (d, k) constrained sequence. We first split this into parts x; of 
length n, say, that is 


C = X1X2X3°°°. 
We want to find sequences y; for i = 1,2,3,... such that 


e The concatenated sequence x;y; has check value zero for all i. 
e The concatenated sequence X1y1X2y2X3y3 °°: is (d, k) constrained. 


Consider x;. Let u the length of the last run of zeros in x; and v the 
length of the first run of zeros in x;+1, that is 


x; =---10" and x41 =O0"1---. 
Then 


u>0,v>0,d<utv<k. 
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We first consider the simpler case k = oo. It can be shown that y; has to 
be of length at least d+2. We will describe how one can choose y; of length 
d+ 2. If u < d, we present three candidates for y;, namely 


20810 zo = 07H10", z3 = 0%. 


First we observe that all of them will make the concatenated sequence 
satisfy the (d, o0) constraint. We further observe that Z2 is obtained from 
Zı by a right shift of the 1, and zg is obtained from zı by changing the 1 
to 0. From the analysis of the non-systematic code above, this means that 
the three sequences X;Z1, XiZ2, and X;Z3 have distinct check values. Hence, 
one of Z1, Z2, Z3 can be chosen as y; to get the check value zero. 

If u > d, we again have three candidates for y;, namely 


LOT 010%, OTe: 


The analysis is similar to the previous case. 

For finite k > 2d + 3, it can be shown that y; has to be of length at 
least 2d +3. We will describe how one can choose y; of length 2d+ 3. The 
description is now split into four cases which cover all possibilities for u and 
v. The analysis is again similar and we just give the three candidates for 
yi in each case in Table 4.8. 


Table 4.8 Three candidates for y;. 


range candidates 

u<d 044109110" > GEWE TO o 024-4 +2104 
v<d<u 02?10410%+1=v ovio4+ijod=v» 0v%1024+2-v 
d<u<|k/2|],v>d_  10¢+110¢ 0107104 o4+2104 
d<vu<|k/2| <u 0710710 o710¢+11 0710¢+2 


4.5 Comments and references 


4.1. Theorem 4.2 is due to de Bruijn, van Ebbenhorst Tengbergen, and 
Kruyswijk (1951). 
The systematic perfect code given is due to Berger (1961). Freiman 
(1962) proved that no perfect systematic code has less redundancy. 
Theorem 4.1 is from Kløve and Korzhik (1995). Freiman suggested using 
the code with {a1,a2,... ap} = {|n/2] £i(a+1)|0<i<n/(2(a+1))} 
to detect up to a asymmetric errors. Note that Example 4.2 shows that 
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this construction may not be optimal. Freiman also has a suggestion for 
a systematic code to detect up to a asymmetric errors. A channel related 
to the Z-channel is the unidirectional channel. For this channel, if there 
are errors in a code word, they are all 0 — 1 errors or all 1 — 0 errors. A 
number of codes have been constructed for correction or detecting of such 
error-patterns. Also there are many constructions of codes correcting t 
symmetric errors (where t usually is 1) and detecting all unidirectional 
errors. For more information on such codes, we refer the reader to the 
books by Rao and Fujiwara (1989) and Blaum (1993). 

The Generalized Bose-Lin codes and their analysis were given by 
Gancheva and Klgve (2005b) who proved Theorems 4.3 and 4.4. For 
q = 2 (and w even) we get the codes given by Bose and Lin (1985). 
For w = 0 (and general q) we get the codes given by Bose and Prad- 
han (1982) (actually, Bose and Pradhan considered the codes with the 
smallest possible value of r, namely r = [log,((q¢—1)k+1)]). For w = 2 
(and general q) we get the codes studied by Bose, Elmougy, and Tallini 
(2005), El-Mougy (2005), El-Mougy and Gorshe (2005). 

The presentation of diversity combining is based on the paper by Kløve, 
Oprisan, and Bose (2005a). Their paper contains a number of additional 
results not presented in this book. 

4.2. The results are essentially due to Karpovsky and Taubin (2004) (with 
a different notation). See also Karpovsky and Nagvajara (1989). 

4.3. Theorem 4.9 and the various constructions in Section 4.3.1 are taken 
from AbdelGhaffar (1998). This paper also contains a good bibliogra- 
phy on ST codes. Another recommendable survey was given by Gallian 
(1996). 

Description of the various decimal codes and barcodes can be found many 
places on the web. As URLs often become obsolete, I do not include any 
here. 

The Verhoeff code was given by Verhoeff (1969). The use of various 
algebraic structures, e.g. quasigroups, for construction of single check 
digits have been studied by a number of authors since. We refer to the 
paper by Belyavskaya, Izbash, and Mullen (2005a) which contains a good 
bibliography for such codes. 

The Norwegian personal number codes were constructed by Selmer 
(1967). 

4.4. The non-systematic code was given by Perry (1995) and the systematic 
code by Perry, Li, Lin, and Zhang (1998). 
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AG(z), 22 

Ac(z), 13 

AG an: vim (C), 24 

A;(C), 13 

A; (C), 22 

a(x) = ac (S), 66 

AMDS, almost MDS code, 113 
ANSI code, 139 

asymmetric Channel, 153 
asymptotically bad class of code, 38 
asymptotically good class of codes, 38 
average, 66 


bad code for error detection, 38 
BCH code, 143 

binary symmetric channel, BSC, 129 
binomial moments, 16 

block synchronization, 115 


CC concatenated code, 9 
CCITT X.25 code, 139 
check matrix, 6 

check sequence, 176 
x(S), support, 2 

code, 1 

code word, 1 

combined correction and detection, 31 
complementary codes, 20 
concatenated code, 9 
concatenation, 3 

constant weight code, 149 


CRC, cyclic redundancy-check code, 
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cyclic code, 5, 114 

cyclic linear codes, 10 

cyclic redundancy-check code, 12 


d, minimum distance, 2 

d(C), minimum distance, 2 
defect, 112 

Delsarte-Goethals code, 146 

du, Hamming distance, 2 
distance distribution, 13 
distance distribution function, 13 
distance invariant, 21 


d(n, M;q), 2 
ô(n, R;q), 2 
ô(R;q), 2 


dual defect, 112 
dual distance, 13 


encoding, 1 

entropy function, 3 
equivalent code, 4 
equivalent linear code, 7 
error check value, 176 
even-weight subcode, 5, 137 
E(X), average, 66 
extended code, 4, 136 
extended Hamming code, 48 
extended linear code, 7 
extended vector, 3 
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Gaussian binomial coefficient, 71 
generalized Hamming weight, 33 
generalized Singleton bound, 30 
generalized weight distribution, 24 
generator matrix, 6 

generator polynomial, 10 

Golay [11 6;3] code, 111 

Golay [23 12;2] code, 111 

good for error detection, 38 


Hamming code, 7, 12, 44, 111 
Hamming distance, 2 
Hamming weight, 2 

H(z), entropy function, 3 


IBM-SDLC code, 139 

IEC TC57 code, 139 

IEEE WG77.1 code, 139 
irreducible cyclic codes, 115 
ISO 3309, 139 


Kerdock code, 146 


linear code, 6 
local symmetric channel, LSC, 118 


MacWilliams transform, 13 
MacWilliams’s theorem, 23 
MDS code, 112 

minimal polynomial, 143 
minimum distance, 2 
minimum support weight, 30 


new linear codes from old, 7 
[n, k,d;q], linear code, 2 

[n, k; q], linear code, 6 

(n, M,d;q), code, 2 

NMDS, near MDS code, 113 
(n, M; q), code, 1 

Nili, j), 27 


optimal code, 36 

optimal error detecting code, 36 
optimal linear code, 36, 132 
ordering vectors, 3 


parity check matrix, 6 

perfect code, 110 

Tue(n, R, p;q), 95 

Tue(R, p), 95 

Tue(R, p), 95 

Tue(R, p), 95 

Pless identities, 14 

Preparata code, 146 

primitive polynomial, 12 

probability of undetected error, 30, 36 

product code, 8, 102 

proper for error detection, 38 

Pae(C, p), average probability of 
undetected error, 66 

Pae(C, K), 30 

P(C, K), 31 

Pae(C, p), asymptotic bounds, 95 

Pae(C, p), upper bounds, 89 

Pae[n, k, piq], 84 

Pae|n, k, p; q], lower bounds, 84 

Pae(n, M, p; q), 84 

punctured code, 4, 136 

punctured linear code, 8 

puncturing, 4 

pure detection, 30 

Pywc(C, a,b), worst-case error 
probability, 79 


q-ary symmetric channel, qSC, 35 
qSC q-ary symmetric channel, 35 


rate, 2 

Reed-Muller code, 140 
repeated code, 9, 102 
repetition code, 110 


satisfactory for error detection, 38 
shortened code, 5, 101 

shortened linear code, 8 
shortening, 5 

simplex code, 7, 111 

single parity check code, 7, 111 
sphere, 27 

standard deviation o, 66 

star operation, 8, 98 


Sturm sequences, 47 

Si (x), sphere, 27 

support, 2 

symbol error probability, 35 
SYSL(n,k), 72 
SYSL(n,k,d), 73 

SYS (n,k), 72 

systematic code, 3 


tensor product code, 8 
threshold, 42 

trace function, 16 
transposition, 165 


ugly for error detection, 38 
undetectable error, 30 
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Var(C,p), 66 
variance, 66 
Var(X), 66 
vector, 1 


weight distribution, 22 

weight distribution function, 22 
weight distribution of cosets, 25 
weight hierarchy, 30 

wu, Hamming weight, 2 
worst-case error probability, 79 


Z-channel, 153 
Z4-linear code, 144 
zero-sum subcode, 5 


